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Foreword 


General Relativity (GR) is founded on the observation of Mercury perihelion preces- 
sion anomaly discovered by Le Verrier and improved by Newcomb, the Michelson— 
Morley experiment, and the precision Edtv6s experiment. Its theoretical basis is 
based on Special Relativity (previously called the restricted theory of relativity), 
Einstein Equivalence Principle (EEP) and the realization that the metric is the 
dynamical quantity of gravity together with the Principle of General Covariance 
and the absence of other dynamical quantities. The establishment of GR in 1915 is 
a community effort with Albert Einstein clearly playing the dominant role. For one 
hundred years, its applicability through solar system to cosmology is prevailing. 
If one includes the cosmological constant (proposed in 1917 by Einstein) in GR, 
there have not been any fully established non-applicable places. The only possible 
potential exception is the missing mass (dark matter)-deficient acceleration issue. 
Dark energy and quantum gravity are needed in the present theoretical foundation 
of physics; however, more experimental clues are needed. The framework applica- 
bility of GR is already demonstrated in theoretical inflation models with quantum 
fluctuations leading to structure formation with experimentally observed spectrum. 

To celebrate the GR centennial, we solicit the writing of 23 chapters in these 
two volumes consisting of five parts: 


Part I. Genesis, Solutions and Energy. 
Part II. Empirical Foundations. 
Part III. Gravitational Waves. 
Part IV. Cosmology. 

Part V. Quantum Gravity. 


Volume 1 consists of Part I, Part II and Part III; Volume 2 consists of Part IV and 
Part V. 

In Part I, Valerie Messager and Christophe Letellier start in Chapter 1 with a 
genesis of special relativity to set the stage. They rely on the original literature to 
make the development clear and connected. Thanks to many thorough researches 
in the last 50 years, the path to general relativity is clear. A concise exposition of 
the path is presented in Chapter 2. In Chapter 3, Christian Heinicke and Friedrich 
Hehl present the historical development and detailed properties of the basic and 
fundamental spherical Schwarzschild and axisymmetric Kerr solutions. In Chap- 
ter 4, Chiang-Mei Chen, James Nester and Roh-Sung Tung expound the important 
and useful concept of energy with its many facets and various applications. 
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In Part II, the empirical foundations of GR are examined. First, the corner- 
stone Einstein Equivalence Principle (EEP) is explored. Ever since about 100 ps 
(the time of electroweak phase transition or the equivalent/substitute) at the quasi- 
equilibrium Higgs/intermediate boson energy scale from the Big Bang (or equiva- 
lent /substitute), photons and charged particles are abundant. With the premetric 
formulation of electrodynamics, we examine the tests of EEP via the metric-induced 
spacetime constitutive tensor density. The non-birefringence of the cosmic electro- 
magnetic wave propagation in spacetime is observed to ultrahigh precision. This 
constrains the spacetime constitutive tensor density to Maxwell—Lorentz (metric) 
form plus a scalar (dilaton) degree of freedom and a pseudoscalar (axion) degree 
of freedom to high precision. The accurate agreement of cosmic microwave back- 
ground spectrum with the Planck spectrum constrains the fractional change of the 
cosmic dilaton to be less than 8 x 1074. The Galileo weak equivalence principle 
(WEP) experiments (Edtvés-type experiments) constrain the fractional dilatonic 
change in the solar system to be less than 10~!°. Accompanying the axion degree 
of freedom is the rotation of linear polarization in the cosmic propagation of elec- 
tromagnetic waves called cosmic polarization rotation (CPR). Sperello di Serego 
Alighieri reviews the constraints from radio galaxy observations and CMB polar- 
ization observations to give a general constraint of 0.02 rad for the mean (uniform) 
CPR and also a constraint of 0.02 rad for the CPR fluctuations. In many inflation 
models dilatons and axions play important roles; these investigations are crucial 
to give clues or constraints on the models. Frequency and time are the most pre- 
cise metrological quantities. Their uses in gravity experiments are unavoidable. The 
use of GR in time synchronization and in GPS, GLONNESS, Galileo and Beidou 
becomes a folk talk. There are two good ways to compare precision clocks: (i) fiber 
links; (ii) space optical links using laser ranging. Etienne Samain expounds the space 
optical link approach and addresses the laser ranging missions T2L2 (Time trans- 
fer by Laser link), LRO (Lunar Reconnaissance Orbiter) and LTT (Laser Time 
Transfer) together with future space mission proposals for fundamental physics, 
solar system science/navigation in which laser links are of prime importance. Solar- 
system observation provides the original impetus and the first confirmation of GR. 
Chapter 8 summarizes the progress of classical solar system tests and explores its 
potential in the future. Improvement of three or more orders of magnitude is still 
possible. 

Perhaps the most dramatic development in testing relativistic gravity and in 
improving the dynamical foundations of general relativity is the discovery and obser- 
vation of pulsars, binary pulsars, millisecond pulsars and double pulsars since 1967, 
1974, 1982 and 2003 respectively. Richard Manchester reviews the pulsar observa- 
tion in its relation with gravity in Chapter 9 with a brief introduction to basic pulsar 
properties and pulsar timing. He presents a rather thorough account of dynamical 
tests of GR and the strong equivalence principle together with a lucid but in-depth 
account of GW detection using pulsar timing arrays (PTAs). See front cover for an 
illustrative schematic of a PTA. 
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In 1916 Einstein predicted gravitational waves (GWs) in GR almost immedi- 
ately after his founding of it. The existence of gravitational waves is the direct 
consequence of general relativity and unavoidable consequences of all relativistic 
gravity theories with finite velocity of propagation. Their importance in GR is like 
that electromagnetic waves in Maxwell—Lorentz theory of electromagnetism. Ein- 
stein’s general relativity and relativistic gravity theories predict the existence of 
gravitational waves. Gravitational waves propagate in spacetime forming ripples 
of spacetime geometry. In the introductory chapter of Part III, Kazuaki Kuroda, 
Wei-Ping Pan and I review and summarize the complete GW spectrum, the meth- 
ods of detection, and the detection sensitivities in various frequency bands with 
a brief introduction to GW sources. At the time Einstein predicted GWs in GR, 
he estimated that GWs were experimentally not detectable due to feeble strengths. 
However, thanks to one hundred years of development of experimental methods and 
technology together with the discovery of various astrophysical compact objects 
and cosmological sources, GWs are now on the verge of detection in three fre- 
quency bands. The very low frequency band (10fHz-300pHz) GWs are on the 
verge of detection by the PTAs; Richard Manchester covers this part in his chapter 
on pulsars and gravity in Part II]. As mentioned in Chapter 10, the observation of 
PTAs has already constrained the isotropic GW background to a level excluding 
most current models of supermassive black hole formation. This is a strong signal 
that PTA observation is on the verge of detecting GWs. The high frequency band 
(10 Hz-100kHz) GWs are on the verge of detection by ground-based interferome- 
ters; Kazuaki Kuroda addresses the detection methods and the sources in the sec- 
ond chapter of Part III. The extremely low (Hubble) frequency band (1 aHz-10 fHz) 
GWs may also be on the verge of detection by CMB polarization observations; the 
present status is briefly reviewed in the introduction chapter of Part III. The low 
frequency band (100 nHz-100mHz) and the middle frequency band detections will 
have the greatest S/N ratios according to the present expectation. We review the 
sources, goal sensitivities, various mission proposals together with the current sup- 
porting activities in the third chapter of Part III. The GW quadrupole radiation 
formula has already been verified by the binary pulsar observations. In the next 
hundred years we will see great discoveries and immense focused activities toward 
the establishment and flourish of GW astronomy and GW cosmology. GW physics 
and GW astronomy will become a precision discipline in the coming century. 

The development of cosmology is most dramatic during the last hundred years. 
From Kapteyn universe in 1915 of observed disk star system of 10 kpc diameter and 
2kpc thickness with the Sun near its center to full-fledged precision cosmology now 
is monumental in the human history. It is fortunate that the development of obser- 
vational cosmology has GR theory as a theoretical basis and goes hand-in-hand 
with the development of general relativity. This is fortunate both for observational 
cosmology and for GR. Using the Cosmological Principle Einstein looked into cos- 
mological solutions in GR in 1917. The fast development of observational distance 
ladder around that time soon extends the reach of astronomy to modern cosmos. 
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Studies in the fundamental issues on the origins of cosmos lead to anthropological 
principle, cosmic inflation, and cosmic landscape scenarios. The cosmos is believed 
to be open in (extended beyond) the Hubble distance scale. Part HI consists of 
seven chapters: Martin Bucher and I present some introductory remarks with a 
discussion of missing mass-deficient acceleration issue in the first chapter; Marc 
Davis reviews the observation and evolution of cosmic structure; Martin Bucher 
give a rather comprehensive exposition of the physics (almost on every aspect of 
cosmology) of CMB; Xiangcun Meng, Yan Gao and Zhanwen Han review the SNla 
as a standardizable distance candle, its nature, its progenitors and its role in the 
cosmology together with related current issues; Toshifumi Futamase on the gravi- 
tational lensing in cosmology; K. Sato and Juni’ichi on cosmic inflation with a brief 
historical exposition on the development in Japan and Russia; David Chernoff and 
Henry Tye on inflation and cosmic strings from the point of view of string theory. 

The quest for a satisfactory quantum description of gravity began very early. 
Einstein thought that quantum effects must modify general relativity in his first 
paper on GWs in 1916. Klein argued that the quantum theory must ultimately 
modify the role of spatiotemporal concepts in fundamental physics in 1927. Part V 
on Quantum Gravity consists of 4 chapters. Chapter 20 gives a bird’s-eye survey on 
the development of fundamental ideas of quantum gravity together with possible 
observations of quantum gravitational effects in the foreseeable future. The classi- 
cal age (1958-1969; according to the chronological classification of Rovelli) started 
with ADM canonical formalism and concluded with DeWitt—Wheeler equation and 
DeWitt’s derivation of Feynman rules for perturbative GR. In the middle ages 
(1970-1983), the discovery of black hole thermodynamics and Hawking’s derivation 
of black hole radiation radically affected our understanding of general relativity. In 
the renaissance period (1984-1994), there are two influential developments. From 
the covariant approach, attempts to get rid of infinities merge into string theory. The 
use of strings and branes extends the theoretical framework of quantum field the- 
ory. From the canonical approach, background-independent loop quantum gravity 
emerged 20 years after DeWitt—Wheeler equation. In Chapter 21, Richard Woodard 
starts with experiences of two personal academic careers through the classical and 
middle ages, advocates that the cosmological data from the epoch of primordial 
inflation is catalyzing the maturation of quantum gravity from speculation into a 
hard science, explains why quantum gravitational effects from primordial inflation 
are observable, reviews what has been done in perturbative quantum gravity, tells 
us what the future holds both theoretically and observationally, and discusses what 
this tells us about quantum gravity. In Chapter 22, Steven Carlip reviews the discov- 
ery of black hole thermodynamics and summarizes the many independent ways of 
obtaining the thermodynamic and statistical mechanical properties of black holes. 
This has offered us some early hints about the nature of quantum gravity. Steven 
then describes some of the remaining puzzles, including the nature of the quantum 
microstates, the problem of universality, and the information loss paradox. In the 
last chapter, Dah-Wei Chiou gives us a rather self-contained introductory review 
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on loop quantum gravity — a background-independent nonperturbative approach 
to a consistent quantum theory of gravity placing emphasis on the fundamental 
ideas and their significance. The review presents the canonical formulation of loop 
quantum gravity as the central topic and covers briefly the spin foam theory, the 
relation to black hole thermodynamics and the loop quantum cosmology with cur- 
rent directions and open issues summarized. 

Although we do not yet have a consistent calculable quantum gravity theory 
which has a good degree of completeness like quantum electrodynamics or quan- 
tum chromodynamics, the efforts to find one already led to the consistent renormal- 
ization of the gauge theory in 1960’s. The new development since 1980’s together 
with more understanding and further development of perturbation theory may give 
clues to a consistent theory. During these endeavors, the quest for a well-developed 
quantum gravity phenomenology including the quest to find a correct inflationary 
(or non-inflationary) scenario may play a significant role. 

The hope is that we will have one within a generation. This book is written and 
assembled for graduate students and general scientific-oriented readers alike. Each 
chapter is basically a review article. The five Parts are interconnected. Different 
combinations can be designed for special topics for graduate students and advanced 
undergraduates. For example, following combinations are suitable for each topic 
named: 


(i) Basics (Selected Topics in GR): Part I, Chapters 8, 9, 10, 13, 20; 
(ii) Empirical Foundations (Empirical Foundations of Relativistic Gravity): Chap- 
ter 2, Part II, Chapters 10, 11, 18, 14, 16, 20; 
(iii) Gravitational Waves: Chapters 2, 9, Part III, Chapters 15, 18, 19; 
(iv) Cosmology: Chapters 5, 6, 10, 12, Part IV, Chapter 20; 
(v) Quantum Gravity: Chapters 3, 4, 10, 18, 19, Part V. 


There can be various other combinations too. 

We are grateful to all contributors for agreeing to write comprehensive reviews 
to make this publication possible. We would also like to thank all the referees for 
their valuable comments and suggestions: Martin Bucher, Stephen Carlip, Dah- 
Wei Chiou, Sperello di Serego Alighieri, Angela Di Virgilio, John Eldridge, Jeremy 
Gray, Friedrich Hehl, Jim Hough, Ekaterina Koptelova, Ettore Majorana, James 
Nester, Ulrich Schreiber, Alexei Starobinsky, David Tanner, Richard Woodard, An- 
Ming Wu, Masahide Yamaguchi. We thank the World Scientific staff, especially 
Dr. K. K. Phua and Kah Fee Ng for their generous support in completing the book. 

We dedicate this two-volume GR centennial book to the founders of GR and 
various communities who have contributed to this dramatic century of development 
and applications of GR. 


Wei-Tou Ni 
November, 2015 
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Note Added in Proof 


After the foreword was written, LIGO Scientific and Virgo Collaborations 
announced in February 2016 and in June 2016 the first direct detections of gravita- 
tional waves (GWs) by LIGO Hanford and LIGO Livingston detectors in September 
2015 and in December 2015. With the LIGO discovery announcements, two impor- 
tant things are verified: (i) GWs are directly detected in the solar-system; (ii) Black 
holes (BHs), binary BHs and BH coalescences are discovered and measured exper- 
imentally and directly with the distances reached more than 1 billion light years. 
These discoveries constitute the best celebration of the centennial of the genesis of 
general relativity. We refer the readers to Refs. 1 and 2 for the discovery and Refs. 
3 and 4 for a brief history of gravitational wave research. 

A web page will be set up for updates of the reviews of these two volumes. 
Please see http://astrod.wikispaces.com/ for announcement. 
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This year marks the 100th anniversary of Einstein’s 1915 landmark paper “Die Feldgle- 
ichungen der Gravitation” in which the field equations of general relativity were correctly 
formulated for the first time, thus rendering general relativity a complete theory. Over 
the subsequent hundred years, physicists and astronomers have struggled with uncover- 
ing the consequences and applications of these equations. This paper, which was written 
as an introduction to six chapters dealing with the connection between general relativity 
and cosmology that will appear in the two-volume book One Hundred Years of General 
Relativity: From Genesis and Empirical Foundations to Gravitational Waves, Cosmol- 
ogy and Quantum Gravity, endeavors to provide a historical overview of the connection 
between general relativity and cosmology, two areas whose development has been closely 
intertwined. 


Keywords: General relativity; cosmology; centennial connection. 


PACS Number(s): 04, 98.80.—k 


One hundred years ago, the best model of the universe summarizing the state 
of the observations at that time was the Kapteyn universe, which consists of a 
system of stars distributed more or less uniformly within a disk about 10 kpe in 
diameter and 2kpc thick. In this model, the Sun is situated near the center of 
the disk. In 1917, Einstein! proposed a static cosmological model based on the 
“cosmological principle,” a generalization of the Copernican principle postulating 
that the homogeneity and isotropy of space in the large should be extended to 
include the time dimension as well. Using distance determinations to about 100 
globular clusters, Shapley in 1918 pushed back the boundaries of the measured 
universe and concluded that the Sun lies near the edge of this distribution. As a 
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result of rapid improvements in astronomical observations, the size of our observed 
universe soon grew to extend almost to our causal horizon. 

This next group of chapters comprising Part IV of GR100 book deals with 
Cosmology, applying Einstein’s theory of general relativity to the universe as a 
whole. Cosmology considers the universe on very large scales and evolving over 
very long times, comparable to the age of the universe. Cosmology today is often 
described as being a “precision” science and the insistence on this term reflects that 
cosmology had not always been seen as such. When Einstein formulated his general 
theory of relativity, little was known about the universe beyond our own galaxy, 
and although distant galaxies had been observed as “nebulous” unresolved spiral 
blotches, it was not at all clear that these “nebulae” consisted of numerous stars 
much like our own galaxy. 

A glimpse of the state of affairs slightly after the formulation of general relativity 
may be gained by reading the written summaries” of the “Great Debate” held 
at the Smithsonian institution in 1920, where Harlow Shapley and Heber Curtis 
sparred over the question of the “scale of the universe.” The former argued that 
the “nebulae” were simply clouds lying on the periphery of our own galaxy, whereas 
the latter maintained that the observations at the time suggested that the “nebulae” 
were distant “island universes” much like our own galaxy. A key observation that 
helped settle this question in favor of the latter point of view was the discovery by 
Edwin Hubble of Cepheid variable stars in the Andromeda galaxy, which established 
that Andromeda lies far beyond the confines of our own galaxy. 

On the observational side, as telescopes and other observational techniques 
greatly improved, our view of the universe progressively expanded to greater and 
greater distances. Once it had been established that the universe was populated by 
galaxies of different sizes, a recurrent question became whether the universe was 
homogeneous and isotropic on the largest observable scales, and it is only recently 
that galaxy surveys sufficiently deep and with sufficient statistics became available 
to settle this question definitively. An excellent historical account of these early 
debates and their role of the development of modern cosmology can be found in 
Peebles’ 1980 book? (see also Ref. 4). 

The theory underlying the hot big bang model as we know it today was 
developed before the observational issues mentioned above had been settled. The 
geometry and time evolution of the universe as predicted by Einstein’s theory 
are given by what is now known as the Friedmann-Lemaitre-Robertson—Walker 
(FLRW) model, which describes the solutions to Einstein’s field equations for a spa- 
tially homogeneous and isotropic universe whose scale factor varies with time. This 
solution to the Einstein field equations was first put forward by Alexander Fried- 
mann in 1922°° and later independently by Georges Lamaitre’ (see also Ref. 8). 
Robertson? and Walker!° subsequently showed that this was the only solution to 
the field equations consistent with spatial homogeneity and isotropy. As discussed 
above, in 1917 Einstein! had put forth a theory of a static universe — a solution 
of the general relativity field equations that is not only homogeneous and isotropic 
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in the three spatial dimensions but also homogeneous in time. Given the lack of 
compelling observational evidence to the contrary at the time, Einstein believed 
that an eternal universe, in which the Copernican principle held not only in three 
spatial dimensions but also in time, was more elegant and hence more plausible. 
In order to satisfy the gravitational field equations, Einstein had to introduce a 
cosmological constant term, denoted by A, a proposal that according to George 
Gamow, Einstein had later once described as his “greatest blunder,” although the 
authenticity of this quote is doubted by some. 

Of the following six chapters of this volume dealing with the connection between 
cosmology and general relativity, the first three chapters deal primarily with the 
observations underpinning our modern conception of the cosmos. The first chapter 
of this group “Cosmic structure” provides a comprehensive overview of galaxy sur- 
veys and the information provided by them regarding the large-scale distribution 
of mass in the universe.!! The second chapter discusses the physics of the 2.725 K 
cosmic microwave background (CMB) radiation.!? The CMB confirms the big bang 
story in two ways. First, the excellent agreement between the observed frequency 
spectrum of the microwave sky and a perfect blackbody spectrum at T’ = 2.725K 
provides strong confirmation of the expanding universe scenario, in which the uni- 
verse at early times was much smaller and hotter, and thus once in a state extremely 
close to thermal equilibrium. Second, the measurement of the small departures from 
homogeneity and isotropy confirms the gravitational instability hypothesis for the 
origin of structure. This chapter describes in detail how the precise mapping of the 
anisotropies of the blackbody temperature in both intensity and polarization can 
be used to test theories of the very early universe based on new physics far beyond 
the standard model and how these fluctuations provide the initial conditions for 
the subsequent evolution leading to the formation of structure. The third chapter 
deals with the SN Ia distance scale and the discovery of the accelerated expansion 
of the universe.!? The fourth chapter “Gravitational lensing in cosmology” details 
the present state of gravitational lensing probes.!+ The observation of the deflec- 
tion of light rays as predicted by general relativity of course dates back to the 
famous expedition led by Arthur Eddington in 1919 to the island Principé off the 
west coast of Africa in order to measure the bending of light by the Sun during a 
total eclipse. Since that celebrated and crucial experiment confirming the theory of 
general relativity, gravitational lensing has evolved into a powerful tool for mapping 
the inhomogeneity in the distribution of mass. 

As pointed out by Robert Dicke in the late 1960s,!°!° a universe that is gov- 
erned by general relativity (through the FLRW solution) and filled with ordi- 
nary matter (i.e., a combination of nonrelativistic and relativistic particles with 
w = p/p ranging from 0 to 1/3) provides a model that is unsatisfactory in several 
respects including (i) the so-called “horizon problem,” whereby distant regions in 
opposite directions in the sky appear similar but could never have been in causal 
contact subsequent to the apparent initial singularity!”1> and (ii) the “flatness 
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problem,” under which the relation between the expansion rate and mean den- 
sity at very early times usually expressed in terms of the dimensionless density 
parameter Q(t) = 87Gpmean(t)/3H7(t) initially had to be tuned near one with 
incredible precision to avoid a universe that today is either nearly empty or already 
re-collapsed.!>:16 

Although Dicke did not propose a concrete alternative, he suggested that the 
big bang model as formulated at the time might not be the whole story. “Cos- 
mic inflation,” reviewed in the fifth chapter,!® is a theory postulating that the very 
early universe underwent a period of quasi-exponential inflation during which what- 
ever inhomogeneities may have existed prior to inflation are effectively erased and 
replaced with quantum fluctuations of the quantum fields relevant during inflation. 
The final chapter “Inflation, string theory and cosmic strings”?° discusses possible 
connections between cosmic inflation and ideas from superstring theory concerning 
how inflation might be realized as part of a theory unifying all the fundamen- 
tal interactions. Inflationary cosmology as developed in the early 1980s offered an 
attractive proposal, solving many of the problems that one would face if one naively 
extrapolated a conventional matter-radiation equation of state backward in time all 
the way to the putative big bang singularity. Moreover, inflation offers a predictive 
mechanism for how the primordial perturbations from a perfectly homogeneous and 
isotropic FLRW background solution are generated. But inflation is an incomplete 
theory whose predictions are not completely defined without specifying a broader 
particle physics model within which inflation is realized. This final chapter reviews 
work on how inflation might be realized within the framework of superstring the- 
ory. These chapters serve to summarize the role of general relativity in modern 
cosmology. 

Two key issues in modern cosmology are the dark matter problem and the dark 
energy problem. Since we have not included separate chapters for these issues, we 
give a brief review here. Both problems lead to postulating a new contribution 
to the stress-energy tensor because the known components combined with gravity 
as we understand it cannot account for the observations. In general, matter and 
objects in the universe have been discovered through nongravitational means, for 
example, through the electromagnetic radiation that they emit or scatter, but the 
history of objects first discovered through their gravitational pull on other objects 
dates quite far back, to the discovery of new outer planets. 

Since Herschel’s discovery of the planet Uranus in 1781, its observed orbit persis- 
tently deviated from its predicted Newtonian trajectory according to the ephemeris 
calculations at the time. Hussey suggested in 1834 that this disagreement could be 
explained by perturbations arising from an undiscovered planet. In 1846, Le Ver- 
rier predicted the position of this new planet. On September 25, 1846, Galle and 
d’Arrest discovered this new planet, now known as Neptune, to within a degree of 
Le Verrier’s prediction. This discovery was a great triumph for Newton’s gravita- 
tional theory and was the first example where deviations of an observed orbit from 
the predicted orbit led to the discovery of missing mass.?4 
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When sufficient data from meridian and transit observations of Mercury had 
been accumulated, in 1859 Le Verrier discovered a discrepancy between the obser- 
vations and Newtonian gravity. This discrepancy may be described as the anomalous 
perihelion advance of Mercury,? at the rate of 38" per century. Using improved cal- 
culations and data sets, Newcomb in 1882 measured this discrepancy more precisely, 
obtaining 42".95 per century. A more recent value is 42".98 + 0.04 per century.?? 

In the last half of the 19th century, efforts to account for the anomalous perihe- 
lion advance of Mercury explored two general directions: (i) Searching for a puta- 
tive planet “Vulcan” or other matter inside Mercury’s orbit and (ii) postulating 
an ad hoc modified gravitational force law. Both these directions proved unsuc- 
cessful. Proposed modifications of the gravitational law included Clairaut’s force 
law (of the form A/r? + B/r*), Hall’s hypothesis (that the gravitational attrac- 
tion is proportional to the inverse of distance to the (2 + 6) power instead of the 
square) and velocity-dependent force laws. (The reader is referred Ref. 23 for an 
in depth history of the measurement and understanding of Mercury’s perihelion 
advance.) 

A compelling solution to this problem had to await the development of general 
relativity. When general relativity is taken as the correct theory for predicting cor- 
rections to Newton’s theory, we understand why when the observations reached an 
accuracy of the order of 1" per century (transit observations), a discrepancy would 


be seen. Over a century, Mercury orbits around the Sun 400 times, amounting 
to a total angle of 5 x 10° arcsec. The fractional relativistic correction (perihelion 
advance anomaly) of Mercury’s orbit is of order €GMgun/dc’, (ie. 8 x 1078) with 
€ = 3 and d being the distance of Mercury to the Sun. Therefore, the relativistic 
correction for perihelion advance is about 40 arcsec per century. As the orbit deter- 
mination of Mercury reached an accuracy of order 1078, the relativistic corrections 
to Newtonian gravity became manifest. 

We thus see how gravitational anomalies can lead either to the discovery of 
missing matter or to a modification of the fundamental theory for gravity. But 
there is also a third more mundane possibility. The secular contribution to the 
change of Moon’s orbit owing to the back reaction of lunar tides of Earth is such 
an example.’ When such orbital anomalies were found, their solution involved nei- 
ther missing mass nor modified Newtonian dynamics. Moreover, the discovery of 
Pluto might be an example of an accidental discovery resulting from the interplay 
between theory and experiment. Observations of Neptune in the late 19th century 
made astronomers suspect that there could be another planet besides Neptune per- 
turbing Uranus’ orbit. In the early 20 century, Lowell searched in vain for such a 
planet at the Lowell observatory, which he founded in Flagstaff. In 1931, Tombaugh 


“Here, “anomalous” means after the much larger corrections from perturbations of other planets 
have been subtracted. Only this residual, or anomalous part, constitutes a sign of something new. 
bThe action on the Earth is to slow down its rotation so that after some time leap seconds need 
to be inserted. 


II-8 M. Bucher and W.-T. Ni 


at Lowell observatory discovered Pluto 6° off its predicted position. However, its 
mass was much smaller than the predicted mass, so its influence on the orbits 
of Uranus and Neptune is negligible given the precision at that time. Pluto thus 
became the ninth major planet. However subsequently, as a result of a redefinition 
by the IAU of what constitutes a major planet, Pluto was downgraded to become 
reclassified as a “trans-Neptunian” object. In 1992, the next trans-Neptunian object 
was discovered and since then more than 1200 such objects have been discovered 
with Eris (discovered in 2005) more massive than Pluto. 

We now turn to dark matter, which was first introduced to account for the 
dynamics of clusters and the rotation curves of spiral galaxies, because the ordinary 
visible baryonic matter in the form of stars assuming plausible mass-to-light ratios, 
and also of the gas in the case of galaxy clusters, was unable to account for the 
observations if the correctness of ordinary Newtonian gravity is assumed. (In this 
context, corrections from special and general relativity are negligible.) The first 
hints of the need to postulate a dark matter component date back to the 1930s, 
when Zwicky found that the virial mass of galaxy clusters (i.e., the mass deduced 
by applying the virial theorem to the observed internal velocities of their member 
galaxies) greatly exceeds the total mass inferred from the luminous matter based on 
plausible mass-to-light ratios.?4 X-ray observations in the 1970s and 1980s alleviated 
this discrepancy without completely resolving it. 

The need to postulate an additional dark matter component of some sort also 
arose from studies of the dynamics of spiral galaxies. In a series of papers in the 
1970s, Rubin, Ford, and Thonnard measured the rotation curves of a number of 
disk galaxies and found that rotation speeds were larger than would be expected 
from the gravitational attraction arising from the visible mass distribution.?°~?" 
Moreover, the shape of the rotation curves inferred from the visible mass did not 
agree with the observations, which found a rotation velocity almost constant with 
varying radius. The authors interpreted their findings as evidence for a new dark 
matter component. 

Logically this conflict, known as the missing mass problem, could arise from a 
mass discrepancy, an acceleration discrepancy, or possibly even both. Many people 
believe that the missing mass is the dark matter, whose presence is made man- 
ifest only through its gravitational interaction with the visible baryonic matter. 
Others however explored the possibility that Newtonian gravitational dynamics 
should be modified. In 1982, Milgrom proposed the phenomenological modified 
Newtonian dynamics (MOND) law for small accelerations.?° Under this hypothe- 
sis, the gravitational dynamics become modified when the acceleration is smaller 
than ag ~ 107!°ms~? (Fig. 1). It was later shown possible to explain such a phe- 
nomenological Ansatz in the framework of a relativistic theory of gravity with 
additional degrees of freedom; however, none of these theories are yet satisfactory 
from a theoretical standpoint.?! 
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Fig. 1. Critical acceleration from galactic rotation curves. The ratio of dynamical to baryonic 
mass is shown at each point along rotation curves as a function of the centripetal acceleration at 
that point. The panel shows data for 74 galaxies (see Ref. 29). The presence of missing or “dark” 
matter is the conventional explanation for why in spiral galaxies the measured rotation curves 
do not agree, neither in magnitude nor in shape, with the rotation curves predicted assuming 
a reasonable mass-to-light ratio and taking into account only the contribution from the disk. 
However, if the MOND hypothesis is adopted under which the gravitational dynamics becomes 
modified when the predicted acceleration is smaller than ag(~ 107~!° ms~?), the rotation curves 
can be fit assuming a constant mass-to-light ratio and no halo dark matter component. (Note: 
Figure reprinted with permission from Ref. 30.) 


Perhaps today one of the strongest arguments for the need for a dark matter 
component arises from CMB anisotropy measurements as explained in the chapter 
on the CMB.!? The standard six-parameter model, which includes a dark matter 
component comprising ~ 24% of the critical mass, provides an adequate fit to the 
observations and it is not possible to account for the observations with a model 
having only baryonic matter. Compared to other probes, the CMB is a particularly 
clean probe of cosmological models and parameters because its interpretation is 
based on linear theory except for small and calculable nonlinear corrections. Except 
for a few degeneracies (which can be lifted by combining with a few other reliable 
ancillary data sets, such as BAO), the errors on the cosmological parameters as 
determined using the CMB are typically of order 1% and characterized by a well 
understood error budget. See Refs. 32 and 33 for a detailed discussion of the current 
status of CMB constraints. 

Another dramatic though less quantitative recent illustration of the need for 
dark matter arises from analyzing the Bullet Cluster as observed by gravitational 
lensing (which provides a clean probe of the mass) and as observed in X-rays. The 
Bullet Cluster, shown in Fig. 2,°4 is in fact the merger of two galaxy clusters. 
The X-ray image highlights the intracluster gas, which has been shocked and thus 
heated up as the result of the collision. This intracluster gas provides the dominant 
contribution to the baryonic mass, the mass from stars being subdominant. If there 
were no dark matter, the mass of the merged system would be concentrated in the 
center of the collision and would roughly coincide with the X-ray hot spot. But this 
is not what is observed in the gravitational lensing reconstruction of the projected 
mass density, which includes all types of mass whether visible or not. Instead, the 


II-10 M. Bucher and W.-T. Ni 


Fig. 2. Chandra X-ray image of the “Bullet Cluster” (red) superposed with lensing image 
(blue). In this merger seen in the X-ray, we see the hot baryonic intracluster gas that was 
shocked at the center of the collision. However in the gravitational lensing reconstruction we see 
two presumably dark matter halos comprising the majority of the mass which have been able 
to pass through each other without interaction. (Credit: NASA, see Ref. 34 for a detailed 
explanation.) (For color version, see page II-CP1.) 


lensing reconstruction shows two halos which have hardly been disrupted by the 
collision apparently simply having gone through each other, as one might expect 
from a weakly interacting dark matter component. While striking, this special sys- 
tem should not be overinterpreted, because it is difficult to render quantitative this 
qualitative interpretation. 

As the case for a dark matter component strengthened, more and more exper- 
iments to search for and ultimately characterize the dark matter have come on 
line. These experiments fall into two classes: direct and indirect. The direct dark 
matter search experiments look for the recoil of nuclei in the detector due to scat- 
tering of nuclei by dark matter particles. Backgrounds, for example from cosmic 
ray muons or from radioactive decays within or near the detector, obviously con- 
stitute a formidable obstacle. Indirect dark matter experiments search for an over- 
production (or anomaly) of particles of various kinds from dark matter decays or 
annihilations. The current status of direct dark matter searches is summarized in 
Fig. 3.°° Indirect dark matter search experiments or observations may be subject 
to conflicting interpretations. The data from the ATIC and PAMELA experiments, 
for example, of the electron and positron fluxes have been interpreted as evidence 
for dark matter annihilation, although other explanations have been put forth. See 
the Ref. 36 and references therein for a recent discussion of the methodology and 
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Fig. 3. Particle dark matter searches: Current status of constraints on WIMP dark matter from 
direct detection. Here spin-independent couplings have been assumed. (Note: Figure reprinted 
with permission from Ref. 35). (For color version, see page II-CP1.) 


current status of indirect searches. To date, no confirmed dark matter candidates 
have been found. However, these experiments have made considerable progress in 
constraining models and in demonstrating and perfecting experimental techniques. 
Search experiments for example have already excluded a large part of the cMSSM- 
preLHC model parameter space. 

The discovery of flat spiral galaxy rotation curves and the subsequent dark 
matter search takes us to the early 1990s when the favored cosmological model 
includes radiation (contributing negligibly to the mean density today), ordinary 
matter, and a weakly interacting dark matter component with a contribution such 
that the sum of the components yields a spatially flat universe. Despite the apparent 
beauty of a spatially flat universe with a vanishing cosmological constant, there 
were a few wrinkles to this story given the evidence at that time including (i) the 
inconsistency between the high measured value of the Hubble constant and the ages 
of the oldest known objects in the universe, (ii) the inconsistency between cluster 
baryon fraction and nucleosynthesis predictions, and (iii) the inconsistency with 
the value of Omatter inferred from large-scale flows. (See, for example, Refs. 37 and 
38 and references therein.) This model was simpler than the six-parameter minimal 
model presently in vogue and by the end of the 1990s, it became clear that another 
component was needed to explain the current acceleration of the universe. This 
could be either a cosmological constant or some other form of stress—energy with 
a similar equation of state (i.e., a component with a large negative pressure) that 
subsequently became known as “dark energy.” 
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In 1970s, when astrophysical and cosmological observational data accumulated, 
the improvement in the determination of the age of globular clusters,?9° the Hub- 
ble constant, and the abundance of elements led to tensions in fitting FLRW cos- 
mological models using only visible and dark matter. There were arguments that 
these measurements might already indicate a nonzero cosmological constant; how- 
ever, the evidence was not yet compelling (see, e.g. Refs. 41-43). In 1998, when the 
light curve correction to the intrinsic variability in absolute luminosities of the type 
Ia supernova had been perfected and a sufficiently large sample of such supernovae 
at intermediate redshift had been accumulated, it was shown that the expansion 
of the universe was accelerating. Acceleration of the scale factor was inconsistent 
with the minimal Qyatter = 1 cosmological model in vogue at the time and could 
be explained by a nonzero cosmological constant or other equally radical extension 
of the then accepted cosmological model. See the chapter SNe Ia as a cosmological 
probe for a detailed account of this development.*! 

A key challenge of contemporary observational cosmology is to characterize the 
nature of the dark energy. If we ignore how the dark energy responds to cosmo- 
logical perturbations, the problem can be expressed as measuring the run of the 
dimensionless parameter w = p/p through cosmic history, where p is the pressure 
and p is the density of the dark energy. If the dark energy were a cosmological 
constant, it follows that w= —1 at all times, but a host of other scenarios have 
been proposed where w is not exactly —1. Observations of the cosmic microwave 
background are ill-suited to characterizing the nature of the dark energy because 
the effect of the dark energy on the CMB anisotropies can for the most part be 
encapsulated into a single number — the angular diameter distance to the surface 
of last scatter of the CMB photons at z © 1100. What is needed is a survey of the 
geometry of the universe out to large redshifts. From cosmological observations, it 
has been inferred that our universe is very close to being spatially flat. In a FLRW 
universe, the luminosity distance as a function of redshift is given by the integral 


z 

dz(z) = (1+2)(Ho) * i dz! [Qm(1 + 2’)? + App (1 + 2’) 72 
0 

where Qpr is the present dark energy density parameter and the equation of state 

of the dark energy w is assumed to be constant. For nonconstant w or a nonflat 

FLRW universe, the expression is similar but slightly more complicated. 

To determine w(z), luminosity distances and angular diameter distances need to 
be measured at many redshifts where dark energy plays a role. These can be mea- 
sured for example by using (i) type Ia supernovae (as standard candles), (ii) baryon 
acoustic oscillations (BAO) in the matter power spectrum (as standard rulers), (iii) 
gamma ray bursts (as standards candle, although at present this method remains 
speculative) and (iv) gravitational waves (GW) from compact binaries and SMBHs 
(as standard “sirens” or candles). All these methods suffer from dispersion and bias 
due to gravitational lensing of distant objects, which can substantially alter their 
apparent luminosities and sizes. GW methods have the potential for high precision, 
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but one will likely have to wait about 20 years for a space-based GW observatory 
(e.g., eLISA**) to be put in place (see Chap. 12 for a review on GW mission proposal 
studies [45]). GW methods are also limited by gravitational lensing. However, since 
the intrinsic measurement uncertainty can be made very small (better than 0.1% 
with enough events), lensing uncertainty can be reduced by accumulating enough 
events for a detailed statistical analysis.*® 

Assuming ACDM, we can investigate whether the parameter values found using 
the CMB observations are consistent with the parameter choices indicated by com- 
pletely independent probes such as observations of type Ia supernovae and observa- 
tions of the scale of the baryonic acoustic oscillations at various redshifts, as shown 
in Fig. 4. The fact that the three ellipsoids overlap indicates that ACDM is telling 
a consistent story. 
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Fig. 4. Concordance of CMB, BAO and SNIa observations. The respective error ellipsoids are 
shown on the Qm—Q, plane. The supernova data are from the updated Union2.1 compilation of 
580 Supernovae. (Note: Figure reprinted with permission from Ref. 47.) (For color version, see 
page II-CP2.) 
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On smaller scales and at later times when nonlinear effects difficult to model are 
dominant, the success of the ACDM is less compelling. This is not because there 
is a clear contradiction between the predictions and what is observed, but rather 
because the predictions of the theory are notoriously difficult to calculate because 
of the presence of nonlinear physics and poorly understood processes such as star 
formation and galaxy formation for which much of the modeling is at present still 
largely ad hoc. It has been argued by some that alternative models of gravity are 
able to account for the observations without positing an additional dark matter 
component. For example, in Fig. 1, it is argued that MOND theory offers a better 
explanation of galactic rotation curves. In MOND proposed by Milgrom?® in 1983, 
Newtonian theory remains valid for large accelerations, but becomes modified for 
centripetal accelerations smaller than the critical acceleration ag ~ 10719 ms~?. 
Numerically, the critical acceleration is related to the cosmological constant A : 
ay ~ A!/? in natural units. Gravitational acceleration, of course, is not an invariant 
in Einstein’s theory, so in order to incorporate MOND into general relativity, some 
additional field would be required to define a preferred time direction or equivalently 
a foliation of spacetime into three-dimensional hypersurfaces. Stratified theories of 
gravity and preferred-frame theories of gravity introduce a preferred foliation. With 
a preferred foliation and extra degrees of freedom other than the physical metric, 
the theories still need to satisfy the strong equivalence principle to experimental 
precision to be viable. This is one reason why stratified theories or preferred-frame 
theories are not easy to construct in an empirically viable manner, so that they are 
consistent with experiments and observations on various noncosmological scales 
and with local experiments. Such a theory was constructed in 1973*%:4° and also 
constructed for MOND phenomenology in 2004.°! Nevertheless, both these theories 
have encountered restrictions as empirical evidence is accumulated. For example, 
pulse timing observations on the relativistic pulsar—white dwarf binary PSR J1738+ 
0333 provide stringent tests of these theories.°° However, GR is covariant, but 
cosmology is stratified in the large. In studies of the microscopic origin of gravity 
and quantum gravity, the question arises whether Lorentz invariance is fundamental 
or derived, especially in the canonical formulation.>! 

The histories of cosmology and relativity theory over the last 100 years have 
been closely intertwined. On the one hand, models of the universe rely on general 
relativity theory for part of their dynamics. On the other hand, the universe provides 
a testing ground for general relativity where Einstein’s theory can be confronted 
with observation under the most extreme conditions: on the largest length scales, 
over the longest time intervals, and at the highest energy scales, close to the putative 
big bang singularity. We close by identifying four areas where despite the existence 
of plausible hypotheses, it is likely that the last word has not been said and surprises 
may be forthcoming: (i) The dark matter acceleration discrepancy problem, (ii) the 
dark energy problem, (iii) the inflationary epoch, and (iv) a possible quantum 
gravity phase. It is likely that over the coming 100 years, part of the story told here 
may be substantially rewritten. 
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Cosmic structure 
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The history of cosmic structure goes back to the time of Einstein’s youth, although few 
scientists actually thought of the problem of galaxy and cluster formation. The data and 
ideas were collected slowly as astronomers slowly realized the nature of the problem of 
large-scale structure. This paper will review several of the key episodes in the history 
of the field. Starting with the discovery of dark matter in the 30s, the CMBR discovery 
in the 1960s to the idea of an early episode of inflation in the 1980s, the field has had 
an acceleration of discovery. In the 80s it was realized that the initial conditions of 
the universe were specified by the cold dark matter (CDM). Now initial conditions for 
the formation of structure could be specified for any type of dark matter. With the 
advent of computing resources, highly nonlinear phases of galaxy formation could be 
simulated and scientists could ask whether cold dark matter was the correct theory, 
even on the scale of dwarf spheroidal galaxies, or do the properties of the dwarfs require 
a different type of dark matter? In an idiosyncratic list, we review several of the key 
events of the history of cosmic structure, including the first measurements of €(r), then 
the remarkable success of A CDM explanations of the large-scale universe. We next turn 
to velocity fields, the large-scale flow problem, a field which was so promising 20 years 
ago, and to the baryon acoustic oscillations, a field of remarkable promise today. We 
review the problem of dwarf galaxies and Lyman-a absorption systems, asking whether 
the evidence is pointing toward a major switch in our understanding of the nature of 
dark matter. Finally, we discuss flux anomalies in multiply-lensed systems, which set 
constraints on the number of dwarf galaxies associated with the lensing galaxy, a topic 
that is now very interesting since simulations have indicated there should be hundreds of 
dwarfs orbiting the Milky Way, rather than the 10 that are known. It is quite remarkable 
that many of the today’s results are dependent on techniques first used by Einstein. 


Keywords: Cosmology; large scale structure. 
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1. History of Cosmic Discovery 


On the birthdate of general relativity, the discovery of the Hubble law,! that the 
large luminous nebulae were nearby galaxies receding from the Milky Way with 
velocity proportional to their distance, was more than 10 years in the future. That 
the galaxies were clustered was realized in the 1920’s but was first discussed by 
Shapley and Ames? upon the completion of their uniform plate survey of the 1000 
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brightest galaxies over the entire sky. The thought that the stars seen in the heavens 
were but a small fraction of the mass of the universe was not realized until 1933. 
The idea that the nature of the clustering would tell us anything about the nature 
of the dark matter was far into the future.? 

The existence of dark matter had been known for the last 70 years, although at 
first few acknowledged it. Zwicky,* after 10 years of observations and comments on 
the strange results he was finding for such nearby clusters as Coma, concluded that 
the velocity dispersion within clusters of galaxies was so high, ~ 1500 km/s, that 
they could not exist for longer than 1 billion years unless there was unseen mass 
holding them together. It was not until the 1980s that reconstruction of the mass 
profile by data from the Uhuru X-ray satellite observed the Coma cluster of galaxies 
and reported that its mass profile was consistent with Zwicky’s earlier estimates.° 
Since then X-ray observations of clusters show this thermal Bremstrallung emission 
to always be present in galaxy clusters. Now, strong lensing of background galaxies 
by rich foreground clusters unambiguously shows the presence of dark matter in 
the foreground. Gravitational lensing is another tool which shows that the clusters 
contain enormous amounts of dark matter, just like Zwicky argued 70 years ago. 
An example is shown in Fig. 1. 


Fig. 1. Gravitational lensing of Abell A2218 taken by the HST (Ref. 6). This beautiful image 
would certainly impress Einstein! The remarkable circular images of faint background galaxies can 
only be produced if there is 50 times the visible mass within the cluster, exactly as Zwicky had 
argued. (For color version, see page II-CP2.) 
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Evidence for dark matter in individual galaxies began to accumulate with 
Rubin et al.’s observation of emission line regions in the Andromeda galaxy,’ 
which they observed to have a flat rotation curve, showing that something exte- 
rior to the starlight dominates its mass distribution. The evidence that the rota- 
tion curves of many galaxies was flat could not be gathered until image tubes 
were available for spectrographs,® and with such a device they found that every 
galaxy in their study had a flat rotation curve (Fig. 2) rather than the expected 
p(r) « r—'/2. which immediately tells us the dark matter in galaxies has a 
mass profile p(r) a r~?, indicating a large amount of dark matter in the outer 
regions of galaxies. This came as a shock to the cosmological community, but 
they already suspected the galaxies and clusters to be hiding vast amounts of dark 
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Fig. 2. Rotation curves of 21 spiral galaxies, spectra taken by Rubin et al. (Ref. 8). Many of 


the galaxies show remarkably flat rotation, indicative of dark matter in the outer portions of the 
galaxy. 
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The outrageous realization that our universe is dominated by an unseen dark 
matter has led to alternative theories, chief among them the so-called MOND the- 
ory (Modified Newtonian Dynamics).2 MOND modifies Newton’s second law by 
the addition of a constant, ag, a rather draconian solution to the dark matter 
problem. But it is healthy for the field to see alternatives and to see how much of 
the cosmological phenomenology can be reproduced by a “simple” but radical mod- 
ification. Of course, this must ultimately be explained by a modified field theory.'° 
However, later work showed that MOND does not even match the linear theory 
result seen on larger scales,'! which is a fatal error, and therefore it shall not be 
covered in this paper. 


12,13 was an incred- 


The notion that our universe underwent an inflationary phase 
ible motivator of an enormous body of work to flesh out details. Finally, we had 
an answer to the question of why the universe should be approximately homoge- 
neous and isotropic, and perhaps there would eventually emerge an explanation of 
the amplitude of the fluctuations. Inflation finally gave us explanations for these 
and why the density of the universe was close to critical. It furthermore motivated 
the experimental search for the dark matter. Now, cosmologists realized that the 
large-scale structure of the universe was telling them the nature of the fluctuation 
growth at a very early time, and publications exploded. 

This paper, will be an idiosyncratic review of key aspects of the historical devel- 
opment of cosmic structure. The field is large and life is short. I apologize if I do 
not not discuss your particular field. 


2. Measurement of the Galaxy Correlation Function 
2.1. Before 1980 


Peebles!* wrote a series of 12 papers in which he applied power spectra analysis 
to all extragalactic catalogs then in existence. The utility of its transform, the 
covariance function €(r), provided a powerful and discriminating cosmological test, 
a tool to determine the veracity of a given theory. 

First, a few definitions given by Peebles which are now the standard. In a catalog 
listing only the right ascension and declination of galaxies, define solid angle dQ 
and catalog density MN. From any galaxy in the catalog, the probability that a 
neighboring cell is occupied is defined as dP = (NdQ){1 + w(8)], where w(9) is 
the angular correlation function, with 0 the angle between the two galaxies. If the 
galaxies are unclustered then w(@) = 0. This of course is sensitive to the distances 
of galaxies in the catalog. More interesting is the three-dimensional definition of 
correlations, dP = ndV[1 + €(r)], where n is the density and dV is the volume 
element and &(r) is the deviation from uniformity as a function of the distance r 
between the two galaxies. 

Of course, the galaxy correlations were presumed to be closely related to the 
underlying mass correlations, an assumption that has held up well. It was shown 
later that the galaxy fluctuations 6, can be generally represented as 6, = bdam, 
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where dam are the fluctuations in dark matter and 6b is the galaxy bias, b ~ 1.3, 
although if it is examined using a sufficiently large catalog of galaxies,'° one can see 
it to be dependent on the galaxy’s mass and color, effects which were known but 
ignored in Peebles earlier work. Peebles showed that the spatial correlation function, 
€(r), of different catalogs obeyed simple scaling laws, indicating a reproducibility 
that must be telling us something quite important about the nature of structure 
formation. An example of the angular correlation scaling Peebles emphasized is 
shown in Fig. 3, where the angular correlations of the Zwicky catalog is plotted 
against the Shane—Wirtanen correlations and the Jagellonian field correlations. The 
nonlinear correlation function has a power law slope of y = 1.8, that is €(r) = 
(r/ro)~7 with ro = 5.4 h~! Mpc. Once the catalogs are adjusted for their differing 
depths, the correlation functions are seen to match. What was the shape of the 
correlation function telling us? 


2.2. After 1980 


The first redshift survey of galaxies was published in 1982,!7 and it opened our 
eyes, giving cosmologists a new, definitive look at what the universe contained, with 
enormous filaments of galaxies, huge voids, and rich clusters at the intersections of 
the filaments. It was clear that the universe was not the result of power-law initial 
conditions throughout the scales of galaxies and clusters, as had been advocated 
prior to 1980. Power-law initial conditions were simply a holding pattern in the 
absence of any better theory, but the birth of the inflationary models!*:!8 changed 
everything. 
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Fig. 3. Left: Angular covariance function for the Jagellonian catalog (squares), Shane—Wirtanen 
catalog (circles), and Zwicky catalog (triangles). Right: Test of the scaling relationship, where the 
w values have been scaled to the Zwicky catalog and the abscissas adjusted for different depths 
(Ref. 16). 
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Until the advent of the inflationary model, it was not clear how to interpret the 
covariance function and the large-scale structure that cosmologists were beginning 
to see.!” At this time the data was clearly ahead of theory. 

Detailed calculations of the temperature fluctuations in the cosmic background 
radiation for universes dominated by massive collisionless relics of the big bang 
were just becoming available!?:?° as well as analytic approximations for the transfer 
function in the early universe, T(k). A first analytic approximation of the density 
perturbation spectrum was 


Ak 


(x1?) = 3/2 4. h2)2” 
(1+ ak + Bk3/? + yk?) 


with a = 1.7T, 6 = 9.013/2, y = 1.01? and T = (Qmh?), with OQ, the matter 
density of the universe and h = Ho/100. The natural expectation was that [ = 0.5, 
for OQ, = 1 and h = 0.7. With this power spectrum one could begin to run n-body 
codes to see if any of this made sense and could match the data.?! 

After careful calibration of the APM catalog of the full southern sky, Efstathiou 
and collaborators!® were able to measure the angular correlation function, w(@), 
over large enough scales to definitively show that [ = 0.2, and not TP = 0.5 as most 
of the cosmological community believed. Figure 4 is what convinced the community 
of the problem. Conventionally it had been assumed that the cosmological density 
parameter 9,, = 1 and that the cosmological constant A is zero. But the data 
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Fig. 4. Data points from the APM catalog (Ref. 18). The heavy black line is the fit for T = 0.2 
while the light dotted line is for T = 0.5 (Ref. 18). Note that it falls well below the data for scales 
greater than 2 degrees, thus ruling out the standard model at the time. The thin solid and dashed 
lines show the results of the linear theory for the scale invariant CDM models with h = 1 and 0.75, 
respectively. This dotted line indicates that something is wrong with the Q,, = 1 CDM model. 
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favored a low-density model, either an open cosmology or one dominated by the 
cosmological constant, with more large-scale power than in the standard Q,, = 1 
CDM model. A A cold matter model (ACDM) was immediately accepted, since an 
open cosmology, Qm, <1, Qa = 0, was even more abhorrent. 


2.3. Remarkable large-scale structure in simulations 


Measurement of the detailed large-scale galaxy correlations would have to await 
the completion of extremely large redshift surveys such as the SDSS and 2dFGRS. 
Beautiful redshift space maps from both of then, is shown in the top and left 
portions of Fig. 5. These projects were only possible with teams of astronomers 
using dedicated telescope time and specialized instrumentation, principally multiple 
fibers allowing the spectroscopic observation of hundreds of galaxies per exposure, 
as opposed to one object at a time in the first redshift surveys. Now, the total 


Fig. 5. Redshift space slices of the CfA2 stickman and the deeper walls seen with the SDSS and 
the 2dFGRS, as well as equivalent slices taken from a simulation of a ACDM cosmogony (Ref. 22). 
The galaxies are colored blue, while the simulation points are colored red. (For color version, see 
page II-CP3.) 
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surveys were several hundred thousand galaxies each. The advance of technology 
made this possible at a reasonable price and telescope time. 

Computer simulations were essential to understanding the models predicted by 
inflation. The most powerful computers available for cosmological simulations in 
the 80’s were made by Digital Equipment Corporation and featured all of 4 MB 
of memory. The earliest models?! did show that CDM models could produce the 
remarkable filaments and voids that had recently been discovered. The models were 
very small, 32° particles, but they were sufficient to make the point. As computing 
power increased, the models grew with them and cosmological simulations used a 
substantial fraction of available computing resources. The Millennium Run?? is one 
of the largest simulations of the formation of structure within the ACDM cosmogony 
so far carried out. It uses 10!° particles to follow the dark matter distribution in a 
cubic region 500h~! Mpc on a side, and has a spatial resolution of 5h~! kpc. 

Equivalent slices from the n-body simulations are shown as red points in Fig. 5. 
Note how well the Millennium simulation matches the statistics of the 2dFRGS 
(left and right). While the simulation does not have a rich cluster like Coma in the 
foreground (top), the statistics are still extremely similar in the bottom images. 
Note the very long stretch of connected filaments in the simulation, matching those 
of the observations. Pictures such as this demonstrate that ACDM fits the data 
extremely well and contains a large grain of truth. 

The observations show the presence of extremely large filaments, and the clusters 
reside at the filament intersections. Note also the elongation of galaxies pointing 
toward the observer, but these are redshift space maps. For example, in the center 
of the CfA2 “stick man,” the cluster is elongated, pointing to us. This is the Coma 
cluster, showing the high virialized velocity dispersion of the galaxies in the cluster, 
just as Zwicky observed 70 years ago. 

Now with these huge redshift surveys it is possible to measure galaxy correlations 
to extremely large scales. Furthermore, they have shown that the clustering strength 
of the galaxies is stronger for those objects of higher luminosity and red color, as 
expected in detailed models that were becoming available. 


2.4. Measurement of the BAO effect 


Since 2000, the acoustic peaks in the cosmic microwave background anisotropy 
power spectrum have clearly become one of the strongest cosmological probes.?% 
They measure the contents and curvature of the universe, demonstrate that the 
cosmic perturbations are generated early (z ~ 1100) and are dominantly adiabatic, 
and by their mere existence largely validate the simple theory used to support their 
interpretation. 

The acoustic peaks occur because the cosmological perturbations are sound 
waves in the fully ionized relativistic plasma of the early universe. The recombi- 
nation to a neutral gas at redshift z ~ 1100 abruptly decreases the sound speed 
and effectively ends the wave propagation of the CMBR and entrained baryons. 
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These fluctuations give rise to Doppler peaks of the CMBR, as well as to fluctua- 
tions known as the baryon acoustic oscillations (BAO), which have size of the first 
doppler peak. Because the universe has a significant ratio of baryons to DM, they 
will be imprinted onto the late-time power spectrum of the nonrelativistic matter. 
The baryonic perturbations continue to move after recombination, reaching a scale 
of 150 Mpc, where they begin to attract the dark matter. The resulting peak is 
perfect for performing the classical angular diameter test to redshift z ~ 1 with 
galaxy samples from the BOSS survey, or to redshifts z ~ 3 with absorption lines 
from the quasar sample of the BOSS survey.74 

For example, Fig. 6 is a first measurement of the BAO effect, from an early 
analysis of 47,000 galaxies from the SDSS. The bump seen in the correlations is 
weak but it tells a remarkable story. First, recall the definition of angular distances. 
The comoving sizes, in a universe with A and matter (Q,,,), of an object or a feature 
at redshift z in line-of-sight (rj) and transverse (r,) directions are related to the 
observed sizes Az and A@ by H(z) and Da(z): 


cAz 


ry = Ha)’ ry = (14+ 2)Da(z)A0. 


The angular distance is given by 


c * dz 
Dal = 755 | rion 


The Hubble parameter is given by 


2 1/2 
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Fig. 6. A first detection of the BAO. The plot of s?€(s) from the SDSS survey (Ref. 23). The 
magenta line is a pure CDM model without a BAO, and the other lines are different values of 
Qmh?. (For color version, see page II-CP3.) 
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Fig. 7. Plot of distance via the BAO technique versus redshift for four separate teams. A ACDM 
model, normalized by data from the Planck satellite, is a perfect fit to the data. This indicates 
that the equation of state parameter w must be close to constancy, as a cosmological constant 
model (Ref. 25). 


and w, the equation of state of the dark energy is 


Piz 
Since the measurement involves an integral of H(z), it is most informative to mea- 
sure the BAO as a function of redshift by combining the results of different surveys. 
An example is given in Fig. 7 showing that the data is exactly consistent with an 
interpretation of w = —1 as a constant, as with the cosmological constant A. The 
BAO measurements are proving to be an excellent means of constraining w(z). 
The growth of studies of the BAO is stunning to this old observer. In the CfA1 
redshift survey!” of bright (mg < 14.5) galaxies, we were at the cutting edge of 
technology and galaxies were measured one at a time. The shot noise was too large 
and the volume too small to allow &(r) to be measured for any scale r > 20 Mpc. 
Now the realistic goal of the observing teams is to measure galaxies ~ 1000 at a 
time with a target of 10° galaxies, or more, giving them the ability to measure €(r) 
reliably to scales r ~ 150 Mpc. Their motivation is to measure the scale of the BAO 
peak to better than 1%, and then to use this as a “standard ruler” for a precise 
cosmological test, setting strong constraints on possible models of dark energy, in 
particular the equation state parameter w. The measurements are going very well 
and the constraint on w is becoming tighter and tighter. This is a worthwhile goal 
as deviations from the standard DE model might be very modest. 


2.5. Further measurements of the power spectrum 


Figure 8 is a plot of the observed power spectrum of structure with wavelength 
spanning four orders of magnitude, where the solid line is ACDM. Asymptotically 
the curve varies as k! on large-scale and as k~? on small-scale. Data ranges from 
measurements of the cosmic microwave background on large-scale to Lyman-a forest 
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Fig. 8. A power spectra summary covering four orders of magnitude from a variety of cosmological 
studies, ranging from CMB measurements to constraints from Ly a@ clouds (Refs. 26 and 27). The 
agreement with the curve, a ACDM model, is remarkable. (For color version, see page II-CP4.) 


clouds on small scales. A cold dark matter (CDM) particle is nonrelativistic as 
different scales come within the horizon. Scales on the left-hand side, where the 
curve is rising, came within their horizon after the universe was dominated by 
nonrelativistic matter, that is matter dominated, while scales on the right-hand side 
of the figure came within their horizon while the universe was radiation dominated. 
If the dark matter is relativistic it will free-stream until the temperature of the 
CMB drops below its mass, and thus perturbations of that scale are erased. This is 
why the perturbations of a light particle, e.g. a particle of rest mass less than 1 keV, 
produce a cut-off in the observable range. If the dark matter is, for example, a warm 
dark matter candidate (WDM), the power spectrum on small scales, high values of 
k, will show a cut-off when the temperature of the universe was equal was equal 
to the rest mass of the particle. The precise slope of the small-scale measurements 
is sensitive to the masses of the neutrino background particle, which will cause the 
slope to bend down from the small scale asymptotic slope of —3. 

It is quite remarkable that the full spectrum of cosmic structure fits the ACDM 
model so well. A tremendous amount of work went into these tests in the last 
30 years, and they all set constraints on cosmic structure as defined by ACDM. 


2.6. Lyman-a clouds 


Observations of redshifted Lyman a forest absorption at z ~ 2—4 in the spectra of 
QSOs provide a sensitive probe of the distribution of gaseous matter in the universe, 
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Fig. 9. A simulation of the Lyman-a forest showing the typical spectrum of a high redshift quasar 
(Ref. 28). The optically thin Ly-a lines are the crossings of the line of sight through the large 
scale filaments. A representation of large scale structure of the filaments is shown as well. (For 
color version, see page II-CP4.) 


indicating it to be highly ionized. The understanding of the origin of the Lyman-a 
clouds became clear only after n-body simulations in the 1980s demonstrated the 
ubiquity of the filamentery clustering throughout the intergalactic medium in a 
CDM universe. A model of the Lyman-a forest is given in Fig. 9 where it is seen 
that the quasar line of sight pierces a filament is the site of a cloud. Prior to 
this realization the community did not have any realistic models for the forest, 
but this model appears to satisfy all the constraints. The precise measurements 
of the clustering of Lyman-a forest absorption lines provides further support for 
cosmological models based on inflation, CDM and vacuum energy, as seen in Fig. 8. 

The Lyman-a forest, due to its spectral nature, probes the matter power spec- 
trum in velocity space. But the measurement is also sensitive to the mass of the 
dark matter particle which would generate a small-scale cut-off in the linear power 
spectrum, with a cut-off that scales approximately inversely with the particle mass 
and is a result of free streaming of the dark matter. If this mass is of order 1 keV, the 
cut-off occurs on the scale of dwarf galaxies, and the dark matter particle would be 
in the class of warm dark matter (WDM).”° If the WDM is a thermal relic of mass 
1keV, the cut-off occurs for k(h/Mpc) > 10, which is slightly beyond the reach of 
the power spectrum plotted in Fig. 8. However, a further probe is to measure the 
individual line shapes of Lyman-a clouds, which will be broadened by the ther- 
mal velocity, and this can be observed with ground-based spectrographs on large 
telescopes. 

By requiring that there be enough small-scale power in the linear power spec- 
trum to reproduce the observed properties of the Lyman-a forest in quasar spec- 
tra, Narayanan et al.?° arrive at a lower limit to the mass of the WDM particle of 
0.75 keV. They argue that any model that suppresses the CDM linear power spectra 
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more severely than a 0.75 keV particle cannot reproduce the Lyman-a forest. Recent 
progress in this field is reviewed by Viel?® who cites upper limits of 4.5 keV. 

Improving these observations is important because a cut-off in small-scale power 
of the dark matter would corroborate the possible cut-off as indicated by dwarf 
galaxies, discussed below. 


3. Large Scale Flows 


Soon after the discovery of the CMBR, the search began for large-scale inhomo- 
geneities in the CMB, and the dipole anisotropy was detected in 1976 in competi- 
tion between groups at Princeton and Berkeley.*!°? With the advent of the COBE 
satellite in 1989 we finally measured the first ~ 20 spherical harmonics, including 
a beautiful picture of the dipole anisotropy. The Milky Way and local group of 
galaxies are moving at 640 km/s, 40° away from the Virgo local supercluster, which 
had been thought to be the attractive center. The question immediately raised by 
this observation was what was responsible for our motion? Was the flow generated 
gravitationally, or does it result from some nongravitational effect? Expectation 
of the CDM models was that the dipole anisotropy should be generated by local 
perturbations, with essentially nothing generated from great distances, z > 0.15. 
This would be the same type of motion which is apparent in n-body models, where 
galaxies and large-scale structure are moving together, simply expressing the local 
gravitational field. 

Because the gravity field is the vector sum of all the perturbations on every 
scale, it makes no sense to break down our motion into components generated 
by named clusters structures, since they do not dominate our gravitational accel- 
eration. Rather, the gravity is controlled by the total mass distribution which is 
dominated by the voids and filaments. About all that can be done is to plot the 
net acceleration of the local group as a function of distance. 

It was recognized that the measurement of large-scale flow could serve as a 
check on 2, as it is a measure of the gravity on scales even larger than clusters, 
~ 50 Mpc. One presumes that the galaxy distribution, multiplied by a bias factor 
“b.” is a substitute for the large-scale mass field. It need not be precise, merely 
tracing the rough outline of the mass distribution. 

Because of the long decay time from the time, z = 1100, when the gravity 
and velocity fields were released from the radiation, any random velocity field has 
decayed, eliminating rotational and other decaying modes. The velocity field present 
now has an especially simple relationship with the responsible gravitational field. 
Those components of the velocity field which are not coherent with the density 
fluctuations have decayed as the universe expands, and so at late times one expects 
the velocity field to be aligned with the gravity field, at least in the limit of small 
amplitude fluctuations. In the linear regime, this relation implies a simple propor- 
tionality between the gravity field g and the velocity field v,, namely vg x gt where 
the only possible time t is the Hubble time. The exact expression depends on the 
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mean cosmological mass density Q,, and is given by*4 
2f(Qm) 
ve(r) = 3 Hon, 20” 
where f(Q) is the linear growth rate. Given complete knowledge of the mass fluc- 


tuation field, 6,(r), over all space, the gravity field g(r) is 


r’—r 


gle) = Gp [ a°x!6,(0) 


where f is the mean mass density of the universe. Replacing the integral over space 
with a sum over the galaxies in a eae we have 
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where 7 is the true mean galaxy density in the sample, 3 = f(Q)/b with f » °° 


the linear growth factor with A included, and with radial selection function ¢(r).* 

The work is greatly aided by supplementing it with secondary distance indicators 
of as many galaxies as possible. These secondary distance indicators can be for 
example, galaxies with measured magnitudes and the amplitude of its asymptotic 
rotation curve, which is sufficient to predict a galaxy’s distance to approximately 
20%.°° Of course with the relatively poor distance indicator, it will take many 
galaxies to produce a meaningful signal, but the samples now include approximately 
5000 galaxies drawn from the full sky. 

The techniques for finding the best flow-field have undergone considerable evo- 
lution, a recent flow-field is show in Fig. 10.°° The velocity field is determined 
from the galaxies having secondary distance indicators by a technique known as 
the inverse Tully—Fisher method, to which approximately 40 orthogonal modes are 
fit. The gravity field is determined by a full-scale sample of 45,000 galaxies in the 
2MRS catalog, which is useful to a distance of z < 0.04. The figures all show the flow 
as seen from the local group frame, but in order to see the deviations beyond the 
dipole, 400 km/s of dipole anisotropy is subtracted from the LG motion of all the 
figures. The top figure is the distribution of galaxies less than cz < 2000 km/s where 
the red is outflowing from our perspective and the blue is inflowing, with the size of 
the symbol being proportional to the velocity. The second row is for galaxies with 
2000 < cz < 4000 km/s, the third row is for galaxies with 4000 < cz < 6000km/s, 
and the fourth row is for galaxies with 6000 < cz < 10000km/s. 

The figures on the right are the gravity field computed from the full sky distri- 
bution of galaxies, with the same change of coordinates as for the figures on the 
left. This gravity field was generated with 0,, = 0.3. Note that the velocity field 
is not a simple dipole, as the local velocity is responding to all the multipoles of 
the local gravity. Note also the quite amazing agreement between the curves on the 


4¢(r) is defined as the fraction of the luminosity distribution function observable at distance r for 
a given flux limit. 
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Fig. 10. The velocity field of local galaxies (left) and the gravity field (right) derived from the 
2MRS sample (Ref. 33). The stars are outflowing from the local group frame of reference and the 
circles are inflowing. The size of the symbol is proportional to the velocity, with a caption of sizes 
drawn at the bottom of the figure. To better see the fields, a 400 km/s dipole has been subtracted 
from each plot. The four rows divide the samples by distance. It is important to note that the two 
fields were independently derived and show essentially perfect agreement, just as linear theory 
predicts. 


left with those on the right. This tells us that for the most part linear perturbation 
theory does describe the motion of galaxies in the universe, a reassuring check. 

It is apparent that the source of the bulk of the 640 km/s dipole pattern seen 
in the local CMBR has 500 km/s due to the galaxy distribution up to a distance 
of 100h~! Mpc, with the rest generated by larger scale structure, as expected in 
ACDM models. There is no need for exotic explanations of our dipole velocity field. 
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The agreement between the velocity field and the gravitational field is particu- 
larly reassuring for our understanding of cosmological theory. Such a conclusion is a 
comfort for linear perturbation theory in an expanding universe and was certainly 
expected, because the effective smoothing of the velocity and gravity fields has left 
only linear perturbations, which appears to be adequate for the large scales tested 
by this method. We see no evidence that the dark matter does not follow the galaxy 
distribution, and it is consistent with constant bias on large scales. There is no evi- 
dence for a nonlinear bias in the local flows. A smooth, unclustered component to 
the dark matter in the universe can only be tested by the agreement between (@ 
measured in the large-scale flows and 2,, measured by the CMBR. To the accuracy 
of the measurement, the two do agree. Furthermore, the velocity—gravity compar- 
ison measures the acceleration on scales up to 30-50 Mpc and since we derived a 
similar value of 3 as for clusters of galaxies, we conclude that dark matter appears 
to fully participate in the clustering on scales of a few megaparsec and larger. 

It is interesting that the counts of galaxies give the best possible gravity field, 
reinforcing the old idea that the mass of the halo around a galaxy is not very well 
correlated with the luminosity of that galaxy. We find no evidence for large-scale 
flows beyond those indicated by the gravity field. Note that our analysis has not 
used the CMBR dipole, but at the end we see a velocity field that is consistent with 
it. We see no evidence that the dipole in the CMBR is produced by anything other 
than our motion in the universe. 


4. Dwarf Galaxies as a Probe of Dark Matter 


Astronomers did not realize that there was a missing satellite problem for the 
Milky Way until large simulations showed an enormous number of substructures 
to every large galaxy. Simulated galaxies retain a large amount of substructure 
formed by earlier collapses on smaller scales, predicting hundreds or thousands 
of subhalos in contrast to the ~ 10 dwarf spheroidal satellites of the Milky Way. 
N-body simulations of the formation of dark matter halos robustly predict that 
approximately 10% of the mass of any galactic dark halo should be in the form of 
sub-halos with numbers that rise steeply toward lower masses. 

For the past 10 years, astronomers have been aware of this disagreement between 
observed dwarf spheroidal galaxies and their numerical simulations of normal galax- 
ies.26 Boylin-Kolchin et al.3” used the aquarius simulations to show that the most 
massive subhaloes in detailed ACDM galaxy-mass simulations of dark matter haloes 
are grossly inconsistent with the dynamics of the brightest Milky Way dwarf 
spheroidal galaxies. The best-fitting hosts of the MW dwarf spheroidals in all CDM 
simulations predict at least 10 haloes with rotation curves double that amount 
observed in MW dwarfs. Another question is what has happened to the missing 
90% of condensed halo’s with the mass of MW dwarfs? 

The dwarf spheroidals have lower dispersion and therefore lower central density 
or lower mass than predicted for the most massive satellites?” as shown in Fig. 11. 
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Fig. 11. Observed Voipe values of the nine bright, Ly > 10°Lo, dwarf spheroidals in the MW 
along with rotation curves corresponding to dwarfs with Vmax = (12, 18, 24,40) km/s (Ref. 37). 
The absence of any dwarfs in the Milky Way with potential of ~ 40km/s is a serious problem for 
the CDM cosmogeny as there are far more than 10 dwarfs in the simulations with potential this 
deep. 


These dwarfs, and the absence of the abundant sub-halos, do not seem to match 
the expectations of ACDM, and this is the first prediction of the theory that fails 
to match the observations. However, the dark matter predictions hold only if the 
velocity dispersion of the dark matter particle is very low, which would be the case 
if the dark matter mass is sufficiently high, = 100 keV. We know so little about the 
nature of the dark matter. Is this a clue to its mass or its history? All we know is 
that the DM is collisionless and has a rest mass of at least 1 keV, with guesses that 
the DM is the supersymmetric partner to an ordinary particle with a mass in the 
range 10-100 GeV. After 30 years of effort, we still have no idea if dark matter is 
a supersymmetric particle with mass greater than a baryon. 

The missing satellite problem seems like it could be solved fairly easily by bary- 
onic physics, which is not included in the simulations above. For example, the 
velocity threshold at which subhalo satellites and dwarf satellite counts diverge is 
close to 30 km/s, a value that is reached by the heating of the ultraviolet photoion- 
izing background which suppresses gas accretion in small halos with low velocity 
dispersion. Perhaps these halos are dark matter only, and the baryons are simply 
too hot to condense? 
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Even if the baryons condense, the difficulty of understanding galaxies is complex 
because the gas will form stars and supernovae, which can reheat the gas, causing 
it to outflow, possibly changing the orbits of the dark matter particles so as to 
reduce the DM central density.2* Whether this is possible for the very low mass, 
~ 10’ Me dwarfs, is not clear. Analytical models*® suggest that with so few stars 
there is not enough energy in supernovae alone to create sufficiently large dark 
matter cores. Alternatively, supernova and stellar winds from the first generation 
of stars could drive remaining gas out of the shallow potential wells of these low 
mass halos making them impossible to see. 

But it has been known for some time that physics including more than gravi- 
tation was essential to the evolution of the baryons if the simulations are to make 
galaxies with observed properties. The addition of supernova feedback and hot 
winds from early type stars, in addition to heating by AGNs, reduces the density 
of baryons and DM in the galactic center and are now realized as critical in galaxy 
formation.*° Thus, it is necessary to understand these additional heating mecha- 
nisms very well before concluding that there is a problem with standard CDM. The 
situation is at present controversial, with different simulators producing different 
results.*4 

Dark matter halos offer a unique method of determining the mass of DM. The 
transfer function of fluctuations on galaxy sizes varies as k~° (k is the wave num- 
ber), provided the DM particle is collisionless and effectively at rest, with small 
free streaming length compared to the size of the fluctuation. WDM is a simple 
modification of CDM in which the dark matter particles have initial velocities due 
either to their having decoupled as thermal relics or to their having been formed 
via nonequilibrium decay. The free streaming length of the DM, the distance a dark 
matter particle will travel in the early universe, erases structure of that mass and 
smaller. An important check on the rest mass of the particle is the smallest astro- 
nomical structure that is dominated by dark matter. If the rest mass of the dark 
matter is sufficiently high then the dark matter has ever more negligible peculiar 
velocities, but for rest masses of order 5 keV the particles today have a primor- 
dial velocity dispersion of approximately 20 km/s, and this is sufficient to play 
an important role in the formation of dwarf galaxies. The structure of galaxies in 
CDM is self-similar because the free streaming length is extremely small so there is 
no scale defined by the collapsing structure. In WDM, the smallest galaxies would 
not be self-similar, but their DM would present a scale, the free streaming scale, 
of order the drift distance a particle would travel in the age of the universe. The 
WDM cosmogony thus erases sufficiently small perturbations. CDM cannot pick a 
scale at the present epoch and any DM particle of rest mass greater than ~ 100 
keV therefore is a CDM candidate. Could it be that WDM erased the small-scale 
perturbations? 

No CDM-based model of the satellite population of the Milky Way explains 
this result. The problem lies in the densities: The number density of galaxies in the 
model is typically set by matching halos to the galaxy counts in the sky and in so 
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doing requires the dwarf spheroidals to have DM haloes that are a factor of 5 more 
massive than is observed. Thus, the problem of the shallow potential wells remains. 

Rather than erasing small-scale perturbations, as in WDM models, another 
method of smoothing the core of dwarf galaxies is to have the dark matter be self- 
interacting (SIDM).*? This model has been discussed several times in the literature 
starting with Spergel and Steinhardt.*? If the dark matter scatters it will form a 
core in the center of galaxies, making the potential well shallower, reducing the 
velocities of the simulation. The central density is reduced as shown in Fig. 12, 
bringing the simulations into closer agreement with the data. But the dark matter 
would need a strongly interacting cross-section which seems unlikely. Rocha et al.*? 
argue that SIDM with o/m ~ 0.1 cm?/g appears capable of reproducing reported 
core sizes and central densities of dwarfs. (1 em?/g = 1 barn/GeV, a nuclear-scale 
cross-section.) They have also shown that self-interaction of that strength does not 
force the galaxies to become spherical,*4 
SIDM models. 

SIDM models usually have assumed the simplest scattering cross-section. The 
particles are like billiard balls with no velocity dependence in their cross-section. 


In the absence of a theory of the dark matter, it is best to keep the simulations 
45,46 


which previously was an argument against 


simplest. However, a few theorists are braver, who use a nonpower-law velocity 
dependence of the cross-section which is motivated by a Yukawa-like term. As in 
the other SIDM models, they find the density profile of dwarfs is unchanged except 
for the constant density core. These models offer an interesting alternative to the 
standard cold dark matter models that can reduce the discrepancy between the 
brightest Milky Way satellites and the dense subhaloes found in CDM simulations. 

The understanding of the nature of the dwarf spheroidal galaxies is clearly 


a point of tension with the CDM models, a problem which must be solved, either 
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Fig. 12. Comparison of the density profiles of a CDM and SIDM halo, showing the core produced 


by elastic scattering. This halo has mass M = 4 x 10!8 Mo, although it is not sensitive to the 
mass (Ref. 42). 
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through astrophysics or particle physics. Perhaps some clarification will emerge from 
detailed high-resolution study of Lyman-a clouds, or from a realistic assessment 
of the effects of the baryonic component? It is quite remarkable that the dwarf 
spheroidals might actually set an extremely important particle physics parameter. 


5. Gravitational Lensing 


Einstein knew all about gravitational lensing, and it is appropriate to celebrate the 
100th year of GR with a section detailing how gravitational lensing is used today. 
This section shall discuss three separate projects where gravitational lensing was 
essential. This is not a complete list but represents the first projects I would desire 
to discuss with Einstein. Reviews of lensing are given by Blandford and Narayan.*” 


5.1. Double images 


In 1979, the astronomical community was greatly surprised when Walsh et al. 
announced the discovery of a gravitational lens.4* Two images of a quasar at z = 1.4, 
separated by 5.7 arc seconds, were the first documented case of a gravitational lens. 
The lens was a cluster of galaxies plus an elliptical galaxy in a cluster at an interme- 
diate redshift. The separation of the lens is twice that expected for most galaxies 
and the quasars are off-center from the galaxy, suggesting the galaxy cluster is 
partly responsible for the lensing. Only a few scientists realized the power of grav- 
itational lensing but soon the field exploded. An example of the spectacular use of 
lensing is that of Markevitch,*® see Fig. 13. Now gravitational lensing has become 
a wonderful, unexpected tool. 


5.2. Bullet cluster 


The bullet cluster is an example of two clusters that have slammed through each 
other and in another ~ 10° years will merge into one, a process that is completely 
normal in the growth of structure, as shown in the beautiful n-body movies. The 
baryons emitting the X-rays are colored red in Fig. 13, while the weak gravitational- 
lensing results showing the existence of dark matter is colored blue. Galaxies are 
mixed with the dark matter, as expected since their cross-section is small, but the 
collisional hot gas shows a bow shock created when the two clouds collided. The 
discovery of this remarkable “bullet cluster”? has shown quite clearly that the dark 
matter is collisionless while the hot gas behaves like normal baryons. This has 
emerged as a very illustrative example that is a favorite to discuss in class with 
students. The problems with the speed of the merging clusters strikes me as a 
minor issue. A MOND explanation for the bullet cluster is a bit far-fetched. 


5.3. Substructure of gravitational lenses 


Another way to search for the missing dark matter substructure is to monitor 
the flux of multiply-imaged quasars behind foreground galaxies. If the galaxy is 
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Fig. 13. Chandra X-ray image of the “bullet cluster” (red) superposed with lensing image of two 
clusters (blue) (Ref. 49). (For color version, see page II-CP1.) 


a smooth potential, then the relative brightness and positions of the quasars can 
be predicted, but if there are multiple dwarf galaxies within the foreground, then 
the microscopic details of the quasars will be anomalous. Kochanek and Dalal®° 
detail results of seven cases and conclude that lowmass subhalos remain the best 
explanation of the galaxy’s potential. They conclude that a large fraction (~1%- 
2%) of the projected mass at the Einstein radius, ~5kpc, must be in the form 
of local substructure. The number of anomalies is far higher than the 10 dwarf 
ellipticals in the Milky Way. When this result was first announced, the astronomical 
community was skeptical, but Dalal and Kochanek have brought together data from 
the radio and the optical which eliminates foreground dust as the explanation. 
This is strong evidence in favor of the ACDM model but the uncertainties of the 
measurement are quite large because of the small sample size. Analysis of the SDSS 
sample will find more multiply-lensed objects, but they will have small angular size, 
making them difficult for detailed followup. 

The multiply-lensed quasars are giving results that appear consistent with the 
standard model but the number of suitable candidates are insufficient to give details. 
A new technique for detecting dwarfs in other galaxies involves ALMA imaging of 
galaxies that are strongly lensing background galaxies containing dust.>! The lensed 
dusty galaxies can be found in the data from the CMB experiments at the South 
Pole. Most galaxies have dust in their nuclear regions of size ~ 1 kpc, corresponding 
to an angular scale of 0.2, a size large enough to prevent lensing by a single star 
in the foreground galaxy. But it will be lensed by dwarf galaxies of order 10° Ma, 
which have an Einstein radius of the same angular extent. The dusty nuclear regions 
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typically show strong velocity gradients, some indicating fast-rotating disks and 
others indicating major mergers. The spatially resolved spectroscopy of ALMA thus 
will permit the enhancement of the spatial resolution. Simulations of the procedure 
appear promising, and they estimate as many as one dwarf object per target. This 
is a brilliant technique for utilizing the magnification of gravitational lenses. 


6. Conclusion 


It is remarkable how developed the ACDM story has become in a relatively short 
time, a beautiful tale that we would love to share with Einstein. It is a new spin 
on his “greatest blunder.” And Einstein would certainly love the uses to which 
gravitational lensing, his tool of 100 years ago, has been applied. The field has 
developed in the last century in ways that were unimaginable while Einstein was 
alive. 

The search for the dark matter is still outstanding after 30 years of effort, but 
hopefully it will soon be discovered either in an underground experiment or at 
CERN’s LHC. The inability to detect the dark matter as regions of phase space are 
steadily excluded will become a problem for the WIMP explanation of dark matter 
unless the next generation of experiments has a positive detection. 

The possible hints of problems with the ACDM model on the scale of dwarfs are 
extremely puzzling and will hopefully be resolved in a few years. Detecting fainter 
dwarfs is certainly worthwhile. The prospect of using ALMA to observe strong 
galactic lenses that are magnifying background dusty galaxies is an excellent idea 
and we hope the experiment works. Or perhaps the dwarf data is telling us that 
the notion of self-interacting dark matter should be taken seriously? 

The next few years will, with the new telescopes under development and further 
study of the remarkable recent advances in the inflationary early universe, lead to 
new, tighter cosmological constraints that will have implications for particle physics 
that I cannot imagine today. 
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Observations of the cosmic microwave background (CMB), especially of its frequency 
spectrum and its anisotropies, both in temperature and in polarization, have played a 
key role in the development of modern cosmology and of our understanding of the very 
early universe. We review the underlying physics of the CMB and how the primordial 
temperature and polarization anisotropies were imprinted. Possibilities for distinguishing 
competing cosmological models are emphasized. The current status of CMB experiments 
and experimental techniques with an emphasis toward future observations, particularly 
in polarization, is reviewed. The physics of foreground emissions, especially of polarized 
dust, is discussed in detail, since this area is likely to become crucial for measurements 
of the B modes of the CMB polarization at ever greater sensitivity. 


1. Observing the Microwave Sky: A Short History and 
Observational Overview 


In their 1965 landmark paper A. Penzias and R. Wilson,!°? who were investigating 
the origin of radio interference at what at the time were considered high frequencies, 
reported a 3.5K signal from the sky at 4GHz that was “isotropic, unpolarized, 
and free from seasonal variations” within the limits of their observations. Their 
apparatus was a 20 foot horn directed at the zenith coupled to a maser amplifier 
and a radiometer (see Fig. 1). The maser amplifier and radiometer were switched 
between the sky and a liquid helium cooled reference source used for comparison. 
Alternative explanations such as ground pick-up from the sidelobes of their antenna 
were ruled out in their analysis, and they noted that known radio sources would 
contribute negligibly at this frequency because their apparent temperature falls 
rapidly with frequency. 
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Fig. 1. Horn antenna used in 1964 by Penzias and Wilson to discover the CMB. (Credit: NASA 
image) 


In a companion paper published in the same issue of the Astrophysical Journal, 
Dicke et al.4° proposed the explanation that the isotropic sky signal seen by Penzias 
and Wilson was in fact emanating from a hot big bang, as had been suggested in 
the 1948 paper of Alpher et al.° suggesting the presence of the photon blackbody 
component having a temperature of approximately a few K. Their prediction was 
based on considering the conditions required for successful nucleosynthesis in an 
expanding universe — that is, to create an appreciable fraction of primordial helium 
from the neutrons that are decaying as the universe is expanding. 

The importance of this discovery became almost immediately apparent, and 
others set out to better characterize this excess emission, which later would become 
known as the Cosmic Microwave Background (CMB), or sometimes in the older 
literature the Cosmic Microwave Background Radiation (CMBR). The two principal 
questions were: (i) To what extent is this background isotropic? (ii) How close is 
the spectrum to a perfect Planckian blackbody spectrum? 

The main obstacle to answering these questions was the lack of adequate instru- 
mentation, and this was the main reason why the first detection of the CMB 
anisotropy had to wait until 1992, when the COBE team announced their observa- 
tion of a statistically significant anisotropy of primordial origin after the dipole due 
to our motion with respect to the CMB had been subtracted? (see Figs. 2 and 3). 
The COBE satellite, in a low-Earth orbit, carried three instruments: the differen- 
tial microwave radiometer (DMR), the far infrared absolute spectrophotometer 
(FIRAS),!°° and the diffuse infrared background experiment (DIRBE). By today’s 
standards, the measurement of the cosmic microwave background anisotropy was 
crude. The angular resolution was low — the width of the beam was 7° (FWHM), 


Physics of the cosmic microwave background anisotropy Tl-45 


AT = 3,353 mK 


AT = 18 pK 


Fig. 2. The microwave sky as seen by the COBE DMR (differential microwave radiometer) instru- 
ment. The top panel shows the microwave sky as seen on a linear temperature scale including zero. 
No anisotropies are visible in this image, because the CMB monopole at 2.725 K dominates. In 
the middle panel, the monopole component has been subtracted. Apart from some slight con- 
tamination from the galaxy near the equator (corresponding to the plane of our Galaxy), one 
sees only a nearly perfect dipole pattern, owing to our peculiar motion with respect to the rest 
frame defined by the CMB. In the bottom panel, both the monopole and dipole components have 
been removed. Except for the galactic contamination around the equator, one sees the cosmic 
microwave background anisotropy along with some noise. (Credit: NASA/COBE Science Team) 
(For color version, see page II-CP5.) 


and the sky map used for the analysis was smoothed to 10° to suppress beam 
artefacts. The contribution from instrument noise was large by contemporary stan- 
dards. The COBE noise was 43, 16, and 22 mK Hz~!/? for the 31, 53, and 90 GHz 
channels, respectively.?!:? Nevertheless, COBE did provide a convincing first detec- 
tion of the CMB anisotropy, and most importantly established the overall level of 
the primordial cosmological density perturbations, which played a crucial role in 
determining the viability of the cosmological models in vogue at that time, which 
were much more numerous and varied than today. 


@By comparison, for WMAP the detector sensitivities were 0.7, 0.7,0.9, 1.1, and 1.5mK-Hz~!/2 
for the K, Ka, Q, V, and W bands, respectively.98 
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Fig. 3. COBE DMR individual frequency maps (with monopole and dipole components removed). 
Taking data at several frequencies is key to proving the primordial origin of the signal and removing 
galactic contaminants. Here are the three frequency maps from the COBE DMR observations. 
(Credit: NASA/COBE Science Team) (For color version, see page II-CP5.) 


The COBE detection was followed by an intense effort to characterize the CMB 
anisotropy at greater sensitivity and on smaller angular scales. There were numerous 
experiments from the ground at locations where the column density of water in the 
atmosphere is particularly low, such as Saskatoon, the Atacama Desert in Chile, the 
South Pole, and the Canary Islands, as well as from stratospheric balloons. Figure 4 
shows the state of play about four years after COBE, with two competing theoretical 
models plotted together with the data points available at that time. In Sec. 6 we 
shall present the physics of the CMB systematically, but jumping ahead a bit, we 
give here a few words of explanation for understanding this plot. In most theoretical 
models, the CMB is generated by an isotropic Gaussian stochastic process, in which 
case all the available information concerning the underlying theoretical model can 
be extracted by measuring the angular power spectrum of the CMB anisotropies. 
Because of isotropy, one may expand the map in spherical harmonics to extract its 
angular power spectrum, defined as 
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Fig. 4. State of CMB observations in 1998. After the COBE DMR detection at very large angular 
scales at the so-called Sachs—Wolfe plateau, numerous groups sought to discover the acoustic peaks, 
or to find their absence. This compilation from 1998 shows the state of play at that time. While an 
unmistakable rise toward the first acoustic peak is apparent, it is not so apparent what happens 
toward higher @. Two theories broadly compatible with the data at that time are plotted, one 
with Q;, = 0 and another with 2; * 0.7, which predicts a higher amplitude of the primordial 
perturbations and an acoustic peak shifted to smaller angular scales, as explained in Sec. 7.3. 
(Credit: Max Tegmark) (For color version, see page II-CP5.) 


It is customary to plot the quantity @(¢ + 1)Ce/(27), which would be constant for 
a scale invariant pattern on the sky.> This is the quantity plotted in Fig. 4, where 
the question posed at the time was whether there is a rise to a first acoustic peak 
followed by several decaying secondary acoustic peaks, as indicated by the solid 
theoretical curves. This structure is a prediction of simple inflationary models — 
or more precisely, of models where only the adiabatic growing mode is excited with 
an approximately scale invariant primordial spectrum. In this plot, one sees fairly 
convincing evidence for a rise in the angular power, but the continuation of the 
curve is unclear. 

Several experiments contributed to providing the first clear detection of 
the acoustic oscillations, namely TOCO,'*4 MAXIMA,” and Boomerang.!°:140 


bStrictly speaking, scale invariance on the sky can be defined only asymptotically in the £— oo 
limit, because the curvature of the celestial sphere breaks scale invariance. The use of ¢(¢ + 1) 
rather than ¢? here is an historical convention. 
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Fig.5. Boomerang observation of the acoustic oscillations. The Ge power spectrum as measured 
by Boomerang is shown on the left without a fit to a theoretical model and on the right with 
the theoretical predictions for a spatially flat cosmological model with an exactly scale invariant 
primordial power spectrum for the adiabatic growing mode. (Credit: Boomerang Collaboration) 


Figure 5 shows the power spectrum as measured by one of these experiments, 
namely Boomerang, where a series of well defined acoustic oscillations is clearly 
visible. 

In the meantime, parallel efforts were underway in Europe and in the United 
States to prepare for another CMB space mission to follow on COBE at higher 
sensitivity and angular resolution. The COBE beam, which was 7° FWHM, did not 
use a telescope but rather microwave feed horns pointed directly at the sky. While 
not allowing for a high angular resolution, the feed horns had the advantage of pro- 
ducing well defined beams with rapidly falling sidelobes. The US NASA WMAP 
satellite, launched in 2001, delivered its one-year data release in 2003 (including TT 
and TE),?!6234 and its first polarization data (including also EE) in 200678° 288 
(see Figs. 6 and 7). WMAP continued taking data for nine years, and released 
installments of papers based on the five-, seven-, and nine-year data releases, in 
which the results were further refined benefitting from a longer integration time 
(which nominally would shrink error bars in proportion to 1/./fops) as well as 
improved instrument modeling.?°9~75! WMAP used horns pointed at a 1.4mx1.6m 
off-axis Gregorian mirror to obtain diffraction limited beams (although the mir- 
ror was under-illuminated somewhat to reduce far sidelobes). The 20 detectors 
were based on coherent amplification using HEMTs (high electron mobility tran- 
sistors) — the state of the art in coherent low-noise amplification at the time. The 
coherent detection technology used by WMAP had the advantage that the elec- 
tronics could be passively cooled. The competing incoherent bolometric detection 
technology permits superior sensitivity, which operates almost at the quantum noise 
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Fig. 6. WMAP single temperature frequency maps. WMAP observed in five frequency bands. 
(Credit: NASA/WMAP Science Team) (For color version, see page II-CP6.) 


limit of the incident photons, but requires cooling to approximately 100mK. The 
cryogenic system required was judged to present substantial risk for a space mission 
at the time, and this was likely one of the reasons why WMAP was selected over one 
of its competitor missions, which was similar to the European Planck HFI proposal. 

The successor to WMAP was the European Space Agency (ESA) Planck 
satellite, which was launched in May 2009.!°° '8! Planck consisted of two 
instruments: (i) a low-frequency instrument with three channels (at 30,44, and 
70 GHz) using a coherent detection technology, and (ii) a high-frequency instru- 
ment using cryogenically-cooled bolometric detectors observing in six bands 
(100, 143, 217, 353,545, and 857 GHz. All bands except the highest two bands were 
polarization sensitive and had a diffraction limited angular resolution, starting at 
33 arcmin for the 30 GHz channel and going down to 5.5 arcmin for the 217 GHz 
channel. (The top three channels have an angular resolution of approximately 5’.) 
Planck HFI took data from August 2009 to January 2012, when the coolant for 
its high-frequency instrument ran out. The Planck Collaboration reported its first 
results for cosmology in March 2013 based on its temperature anisotropy data. 
The first results for cosmology using the polarization data collected by Planck were 
delivered in February 2015. 
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Fig. 7. WMAP single frequency polarization maps. The lines (which may be thought of as double- 
headed vectors) show the amplitude and orientation of the linear polarization of the measured 
CMB anisotropy in the indicated bands. The amplitude P = \/Q? + U? is also shown using the 
indicated color scale. (Credit: NASA/WMAP Science Team) (For color version, see page II-CP7.) 


Figure 8 shows the Planck full-sky maps of the intensity of the microwave emis- 
sion in nine frequency bands ranging from 30 GHz to 857 GHz. Figure 9 shows a 
cleaned full-sky map where a linear combination of the single band maps has been 
taken in order to isolate the primordial cosmic microwave background signal. While 
the fluctuations in the cleaned map in Fig. 9 do not appear to single out any partic- 
ular direction in the sky and appear consistent with an isotropic Gaussian random 
process, the maps in Fig. 8 show a clear excess in the galactic plane. These full-sky 
maps use a Molleweide projection in galactic coordinates, with the galactic center 
at the center of the projection. Even though the galactic contamination depends 
largely on the angle to the galactic plane, with a lesser tendency to increase toward 
the galactic center, considerable structure can be seen in the galactic emission over 
a broad range of angular scales. The central bands include the least amount of 
galactic contamination, which visibly is much larger for the lowest and the highest 
frequencies shown. 

The CMB dipole amplitude is AT = 3.365 + 0.027mK and directed toward 
(1, b) = (264.4° + 0.3°, 48.4 + 0.5°) in galactic coordinates.!°4 The maps have been 
processed to remove the 2.725 K CMB monopole component as well as the smaller 
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Fig. 8. Planck single frequency temperature maps. The ESA Planck satellite surveyed the sky 
in nine broad [i.e., (Av)/v & 0.3] frequency bands, centered at 30, 44, 70, 100, 143, 217, 353, 545, 
and 857 GHz, shown in galactic coordinates. The units are CMB thermodynamic temperature 
[see Eq. (3)]. The nonlinear scale avoids saturation in regions of high galactic emission. (Credit: 
ESA/Planck Collaboration) (For color version, see page II-CP8.) 


CMB dipole having a peak-to-peak amplitude of approximately 6.73mK, so that 
only the spherical harmonic multipoles with @ > 2 are included. CMB angular 
power is typically expressed in terms of Dy = ¢(€ + 1)C;/2z, which in the flat sky 
approximation corresponds to rms power (in K?) per unit logarithmic interval 
in the spatial frequency. A scale invariant temperature spectrum on the celestial 
sphere would correspond to ¢?C,¢ being constant. For comparison we give here a 
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Fig. 9. Planck CMB map obtained from combining the single-frequency maps. A linear combi- 
nation of the Planck single frequency maps (shown in Fig. 8) is taken. This linear combination is 
optimized to filter out any unwanted contaminants based on their differing frequency dependence. 
The success of this procedure to remove the galactic contaminants is evident from the disappear- 
ance of excess power along the equator corresponding to the galactic plane. (For more details, see 
Ref. 161.) (Credit: ESA/Planck Collaboration) (For color version, see page II-CP9.) 


few ballpark numbers characterizing the strength of the primordial CMB temper- 
ature anisotropy. Figure 10 plots the CMB power spectrum as observed by Planck 
together with the predictions of the fit to a six-parameter theoretical model, which 
we will discuss further below. The magnitude at low ¢, before the rise to the first 
acoustic, or Doppler, peak at £ & 220, is De ~ 10° wK?, which would correspond 
to an rms temperature in the neighborhood of 30K. The rise to the first acoustic 
peak increases the power by about a factor of six, and then after approximately 
five oscillations, damping effects take over, making the oscillations less apparent 
and causing the spectrum to suffer a quasi-exponential decay. 

Although the detectors (to the extent that their response is ideal or linear) 
measure the intensity expressed as the “spectral radiance” or “specific intensity” 


I, (having units of erg s~-' cm~? Hz 1 str 1) averaged over a frequency band 
defined by the detector, it is convenient to re-express these intensities in terms of an 
effective temperature. There are two types of effective temperature: the Rayleigh— 
Jeans (R-J) temperature and the thermodynamic temperature. It is important to 
keep in mind the distinction, especially at high frequencies [in this context v > 
vomp = h~'(kgTomp) = 57 GHz] where the factor relating the two becomes large. 

In the R-J limit, where (hv/kpT) « 1, the blackbody spectral radiance may be 
approximated as I, = B,(T) & 2(v/c)?(kpT), as one would obtain classically from 
the density of states of the radiation field and assigning an energy (kp7’) to each 
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Fig. 10. CMB angular power spectrum as measured by Planck. The binned GEE power spectrum 
as given in the Planck 2013 release is plotted with error bars that combine uncertainties from 
cosmic variance (dominant at low @) and instrumental noise (dominant at high @). The solid curve 
indicates the theoretical predictions of the six-parameter concordance model with the 1a cosmic 
variance for the adopted binning scheme. The lower panel shows the residuals. For more details, see 
Ref. 166. The exquisite fit, seen here up to about @ 2200, into the damping tail, has been shown 
to extend to smaller angular scales by ACT®® and SPT.1°? (Credit: ESA/Planck Collaboration) 


harmonic oscillator degree of freedom. If we invert, assuming the R—J limit above 
(regardless of whether this limit is valid), we obtain the following definition for the 
R-J temperature 


Tr-s(v) = Qke (<) I,. (2) 


The “thermodynamic” temperature corresponding to a specific intensity I, at a 
certain frequency, on the other hand, is obtained by inverting the unapproximated 
blackbody expression [, = B,(T) = (2hv3/c?)[exp(hv/kpT) — 1]. For the CMB, 
where the variations about the average CMB temperature are small (ie., AT/T = 
10~°), linearized perturbation theory is valid and one can invert this equation, 
obtaining 


(e? - 1) 


; éTR-J; (3) 


éTcmB = = 
xe 
where x = hy /kgT. This factor is approximately one for x < 1, but rises exponen- 
tially for x > 1. 
It should be remembered that for adding optically thin emissions, it is the inten- 


sities, and not the thermodynamic temperatures, that add. Moreover, foreground 
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emission power laws know nothing about Tcomp and thus are naturally expressed 
using either the specific intensity, or equivalently the R—J temperature. 

Other important milestones of CMB observation include the DASI discovery of 
the polarization of the CMB in 2002!" and the observations at small angular scales 
carried out by the ACT and SPT teams, which at the time of writing constitute 
the best measurements of the microwave anisotropies on large patches of the sky at 
high angular resolution, measuring the power spectrum up to @ = 10,000. The DASI 
experiment was carried out using several feed horns pointed directly at the sky. 
Correlations were taken of the signals from pairs of horns, thus measuring the sky 
signal interferometrically. The 6m diameter Atacama Cosmology Telescope (ACT) 
situated in the Atacama desert in Chile®*1°° and the 10m SPT (South Pole Tele- 
scope) located at the Amundsen-Scott South Pole Station in Antarctical0?:19°:194 
probe the microwave sky on very small angular scales, beyond the angular reso- 
lution of Planck. CMB observations are almost always diffraction limited, so that 
the angular resolution is inversely proportional to the telescope diameter. Ground 
based instruments, despite all their handicaps (i.e., atmospheric interference, lack 
of stability over long timescales, ground pickup from far sidelobes) will always out- 
perform space based experiments in angular resolution. There are dozens of other 
suborbital experiments not mentioned here but which paved the road for contem- 
porary CMB observation. These experiments were important not only because of 
their observations, but also because of their role in technology development and 
in the development of new data analysis techniques. More information on these 
experiments can be found in other earlier reviews.° 


2. Brief Thermal History of the Universe 


The big bang model of the universe is an unfinished story. Certain aspects of the 
big bang model are well established observationally, while other aspects are more 
provisional and represent our present best bet speculation. In the account below, 
we endeavor to distinguish what is relatively certain and what is more speculative. 

If we assume: (i) the correctness of general relativity, (ii) that the universe is 
homogeneous and isotropic on large scales (at least up to the size of that part of 
the universe presently observable to us), and (iii) that for calculating the behav- 
ior of the universe, small-scale anisotropies can be averaged over, we obtain the 
Friedmann—Lamaitre-Robertson—-Walker (FLRW) solutions to the Einstein field 
equations, which we now describe. The metric for this family of solutions takes 
the following form: 


ds* = —dt? + a?(t)yijdx'dz’, (4) 


°See http://lambda.gsfc.nasa.gov/links/experimental_sites.cfm for an extensive compilation of 
CMB experiments with links to their websites. The book!” provides an insightful account empha- 
sizing the early history of CMB observations with contributions from many of the major partici- 
pants. 
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where (i, 7 = 1,2,3) and the line element dé? = y;;dx'dx) describes a maximally 
symmetric three-dimensional space that may be Euclidean (flat), hyperbolic, or 
spherical.¢ For the flat case, which is the most important and most often discussed, 
Eq. (4) takes the form 


ds? = —dt? + a?(t)[dx? + dag” + das”). (5) 


Here a(t) represents the scale factor of the universe. Under the assumption of homo- 
geneity and isotropy, the stress-energy tensor (expressed as a mixed tensor with 
one contravariant and one covariant index) must take the form 


p(t) 0 0 0 
T=! 9 0 pit) 0 (6) 
0 0 0. pit) 


The Einstein field equations Gy, = Ruy — (1/2)gu,R = (87G)T,,, may be reduced 
to the two equations 


He) = SR = Eo - a (7) 
and 
ay =~ goo +3. (8) 


(For more details, see for example the discussion in the books by Weinberg??! or 
Wald.?18) 

The above discussion, which included only general relativity, is incomplete 
because no details concerning the dynamics of p(t) and p(t) have been given. The 
only constraint imposed by general relativity is stress-energy conservation, which 
in the most general case is expressed as T,,,°” = 0, but in the special case above 
takes the form 

dp(t) 


Gr = BH (lol) + PO). (9) 


dMore generally, the three-dimensional line element 


dx? + dxy? + dx32 


(2 (5) (a1? + 22” 4 rs) 


where k = +1,0, or —1 corresponds to a spherical, flat (Euclidean), or hyperbolic three- 
dimensional geometry, respectively. Apart from an overall change of scale, these are the only geome- 
tries satisfying the hypotheses of spatial homogeneity and isotropy. By substituting tan(y/2) = r/2 
or tanh(x/2) = r/2 for the cases k = +1 or k = —1, we may obtain the more familiar 
representations ds? = dy? + sin x7dQ (2)? or ds? = dy? + sinh y7dQ (2), respectively, where 
dQ a)? = dé? + sin 6? d¢?. 


ds? = viz dada! = 
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The relationship between p and p is known as the equation of state, and for the 
special case of a perfect fluid p depends only on p and on no other variables. Two 
special cases are of particular interest: a nonrelativistic fluid, for which p/p = 0, and 
an ultrarelativistic fluid, for which p/p = 1/3. Until the mid-90s it was commonly 
believed that except at the very beginning of the universe, a two-component fluid 
consisting of a radiation component and matter component suffices to account for 
the stress—energy filling the universe. In such a model 


Ae eae (3) vee (3) - ve) 


Here the subscript 0 refers to the present time. For the radiation component, there is 
obviously the contribution from the CMB photons, whose discovery was recounted 
above in Sec. 1. If we assume that the universe started as a plasma at some very 
large redshift with a certain baryon number density, most conveniently parame- 
terized as a baryon-to-photon or baryon-to-entropy ratio, we would conclude that 
there was statistical equilibrium early on between the various species, and from 
this assumption we can calculate the neutrino density today. It turns out that the 
neutrinos are colder than the photons because the neutrinos fell out of statistical 
equilibrium before the electrons and positrons annihilated. The ete~ 
had the effect of heating the photons relative to the neutrinos, because all the 
entropy that was in the electrons and positrons was dumped into the photons. 
Consequently, assuming a minimal neutrino sector, we can calculate the number of 
effective bosonic degrees of freedom due to the cosmic neutrino background, which 
according to big bang theory must be present but has never been detected. Measur- 
ing the number of baryons in the universe today is more difficult, and is the subject 
of an extensive literature. The realization of a need for some additional nonbaryonic 
“dark” matter dates back to Fritz Zwicky’s study of the Coma cluster in 1933 and 
is a long story that we do not have time to go into. Today “cold dark matter” is 
part of the concordance model, where the name simply means that whatever the 


annihilation 


details of this extra component may be (the lightest supersymmetric partner of the 
ordinary particles in the standard electroweak model, the axion, or yet something 
else), for analyzing structure formation in the universe, we can treat this component 
as nonrelativistic (cold) particles that can be idealized as interacting only through 
gravitation. 

In the mid-1990s it was realized that a model where the stress—energy included 
only matter and radiation components could not account for the observations. A 
crucial observation for many researchers was the measurement of the apparent 
luminosities of Type Ia supernovae as a function of redshift,!°4!9? although at that 
time there were already several other discordant observations indicating that the 
universe with only matter and radiation could not account for the observations.© To 


©See for example some of the contributions in Ref. 213 and references therein. 
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reconcile the observations with theory, another component having a large negative 
pressure, such as would arise from a cosmological constant, was required. 

A cosmological constant makes a contribution to the stress—energy T),,, propor- 
tional to the metric tensor g,,, meaning that p = —p exactly. This relation inserted 
into the right-hand side of Eq. (9) implies that any expansion (or contraction) does 
not change the contribution to the density from the cosmological constant. 

With a cosmological constant, the density of the universe scales in the following 


AK (a) Te (a) eee (3) - my 


The cosmological constant is most important at late times, when its density rapidly 


comes to dominate over the contribution from nonrelativistic matter (consisting 
according to our best current understanding of baryons and a so-called cold dark 
matter component). It is customary to express the contributions of each component 
as a fraction of the critical density poit = 3H?/87G, which would equal the total 
density if the universe were exactly spatially flat. For the ith component, we define 
04 = (p:/ Perit). 

From the present data, we do not know whether the new component (or compo- 
nents) with a large negative pressure is a cosmological constant, although current 
observations indicate that w = p/p for this component must lie somewhere near 
w = —1. A key question of many of the major initiatives of contemporary observa- 
tional cosmology is to measure w (and its evolution in time) in order to detect a 
deviation of the behavior of the dark energy from that predicted by a cosmological 
constant. 

The earliest probe of the hot big bang model arises from comparing the primor- 
dial light element abundances as inferred from observations to the theory of primor- 
dial nucleosynthesis. In a hot big bang scenario, there is only one free parameter the 
baryon-to-photon ratio 7p, or equivalently, the baryon-to-entropy ratio 7s, which is 
related to the former by a constant factor. As discussed above, in the hot big bang 
theory at early times, in the limit as a(t) — 0, we have p > oo, and T — oo. For 
calculating the primordial light element abundances, we do not need to extrapolate 
all the way back to the initial singularity, where these quantities diverge. Rather 
it suffices to begin the calculation at a temperature sufficiently low so that all the 
baryon number of the universe is concentrated in nucleons (rather than in free 
quarks), but high enough so that there are only protons and neutrons rather than 
bound nuclei consisting of several nucleons, and so that the reaction 


pte =nt+kk, (12) 


(as well as related reactions) proceeds at a rate faster than the expansion rate of the 
universe H. Under this assumption, the ratio of protons to neutrons is determined 
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by an equilibrium condition expressed in terms of chemical potentials 


u(p) + wle~) = p(n) + w(ve) + AM, (13) 
where 


AM = My — Mp — M = 939.56 MeV — 938.27 MeV — 511 KeV = 0.78 MeV. 
(14) 


At first, when T >> (AM), the protons and neutrons have almost precisely equal 
abundances, but then, as the universe cools down and expands, the neutrons become 
less abundant than the protons owing to the neutron’s slightly greater mass. Then 
the reactions (12) freeze out — that is, their rate becomes small compared to H — 
and the neutron—proton ratio becomes frozen in. Subsequently protons and neutrons 
fuse to form deuterium, almost all of which combines to form “He, either directly 
from a pair of deuterons or through first forming *He, which subsequently fuses 
with a free neutron. Almost all the neutrons that do not decay end up in the form 
“He, because of its large binding energy relative to other low A nuclei, but trace 
amounts of other light elements are also produced. Even though other heavier nuclei 
have a larger binding energy per nucleon and their production would be favored by 
equilibrium considerations, the rates for their production are too small, principally 
because of the large Coulomb barrier that must be overcome. 

In primordial big bang nucleosynthesis (BBN) the only free parameter is 7p. 
Since nucleosynthesis takes place far into the epoch of radiation domination, 
the density as a function of temperature is determined by the equation p(T’) = 
(x?/30)NyeiT'* where N;e) is the effective number of bosonic degrees of freedom. 
The expansion rate then is important because it determines for example how many 
neutrons are able to be integrated into nuclei heavier than hydrogen before decaying. 
Based on the particles known to us that are ultrarelativistic during nucleosynthe- 
sis, the photon, electron, positron, neutrinos, and antineutrinos, we think that we 
know what value N;.; should have. But nucleosynthesis can be used to test for the 
presence of extra relativistic degrees of freedom. We shall see later that the CMB 
can also be used to constrain N,,). For a review with more details about primordial 
nucleosynthesis, see Ref. 219. 


3. Cosmological Perturbation Theory: Describing a Nearly Perfect 
Universe Using General Relativity 


The broad brush account of the history of the universe presented in the previous 
section tells an average story, where spatial homogeneity and isotropy have been 
assumed. As we shall see, early on, at large z, this story is not far from the truth, 
especially on large scales. But a universe that is exactly homogeneous and isotropic 
would be quite unlike our own, in that there would be no clustering of matter, 
observed very concretely in the form of galaxies, clusters of galaxies, and so forth. 


Physics of the cosmic microwave background anisotropy TI-59 


In this section, we present a brief sketch of the theory of cosmological perturba- 
tions. For excellent reviews with a thorough discussion of the theory of cosmological 
perturbations in the framework of general relativity, see Refs. 103 and 139. 

For the “scalar” perturbations, we may write the line element for the metric 
with its linearized perturbations ®(x,7) and V(x,7) in the following form: 


ds* = a?(n)[—(1 + 26)dn? + (1 — 2W)dx?]. (15) 


The functions Y(x,7) and ®(x,7) are the Newtonian gravitational potentials. At 
low velocities ®(x,17) is most relevant, but Y(x,7) can be probed observationally 
using light or some other type of ultrarelativistic particles. When there are no 
anisotropic stresses (i.e., all the partial pressures of the stress—energy tensor are 
equal), d= WU. 

For a universe filled with a perfect fluid with an equation of state p = p(p), 
the evolution of the Newtonian gravitational potential @(x,17) is governed by the 
equation (see Ref. 139 for a derivation and more details) 


@" + 3H(1+c,7)®' —c,?V?® + [27' + (1 + 3c,7)H7]6 = 0, (16) 


where c,” = dp/dp, H = a’/a, and‘ = d/dn. Here 7 is the conformal time, related 
to the more physical proper time by the relation dt = a(n)dn. 
If the spatial derivative term (i.e., V2®) is neglected, which is an approximation 


valid on superhorizon scales, then the derived quantity!®*% 
2(@©+H'6’) 
= 4 17 
¢ 3 (l+w) ay) 


(where w = p/p) is conserved on superhorizon scales. This approximate invariant, 
which can be related to the spatial curvature of the surfaces of constant density, is 
invaluable for relating perturbations at the end of inflation (or some other very early 
noninflationary epoch) to the later times of interest in this chapter. This property 
is particularly useful given our ignorance of the intervening epoch, the period of 
“reheating” or “entropy generation,” the details of which are unknown and highly 
model dependent. 

To understand the qualitative behavior of the above equation, it is important to 
focus on the role of the Hubble parameter H(t) = a(t)/a(t), which at time ¢ defines 
the dividing line between superhorizon modes |for which |kpnys| < H(t)] and sub- 
horizon modes [for which |kpnys| > H(t)].2 This distinction relates to the relevance 


0 66. ? 


In the theory of cosmological perturbations, the terms “scalar,” “vector,” and “tensor” are given 
a special meaning. A tensor of any rank that can be constructed using derivatives acting on a 
scalar function and the Kronecker delta multiplying a scalar function is regarded as a “scalar”. 
A quantity that is not a “scalar” but can be constructed in the same way from a 3-vector field 
is regarded as “vector,” and a “tensor” is a second-rank tensor with its “scalar” and “vector” 
components removed. Under these definitions, the linearized perturbation equations reduce to a 
block diagonal form in which the “scalar,” “vector,” and “tensor” sectors do not talk to each 
other. 

Here “horizon” means “apparent horizon,” which depends only on the instantaneous expansion 
rate, unlike the “causal horizon,” which depends on the entire previous expansion history. 
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of the spatial gradient terms in the evolution equations for the cosmological pertur- 
bations. This same distinction may be re-expressed in terms of the conformal time 
7 and the co-moving wavenumber k, where 1 = a’/a takes the place of H = a/a, 
with superhorizon meaning |k| < H(t) and subhorizon meaning |k| > H(t). Here 
dots denote derivatives with respect to proper time whereas primes denote deriva- 
tives with respect to conformal time. It turns out that for calculations, working 
with conformal time and co-moving wavenumbers is much more convenient than 
working with physical variables. 

What happens to the perturbations as the universe expands depends crucially 
on whether the co-moving horizon size is expanding or contracting. During infla- 
tion, or more generally when w < —1/3, the co-moving horizon size is contracting. 
This means that the dynamics of Fourier modes, which initially hardly felt the 
expansion of the universe, become more and more affected by the expansion of the 
universe. Terms proportional to H and H? in the evolution equations are becoming 
increasingly relevant whereas the spatial gradient terms are becoming increasingly 
irrelevant as the mode “exits” the horizon, finally to become completely “frozen 
in” on superhorizon scales. During inflation a mode starts far within the horizon 
and crosses the horizon at some moment during inflation. At the end of inflation, 
all the modes of interest lie frozen in, far outside the horizon. 

During inflation, w = p/p was slightly more positive than —1. Formally inflation 
ends when w crosses the milestone w = —1/3, which is when the co-moving horizon 
stops shrinking and starts to expand. This moment roughly corresponds to the onset 
of “reheating” or “entropy generation,” when the vacuum energy of the inflaton 
field gets converted into radiation, or equivalently ultrarelativistic particles, which 
afterward presumably constitute the dominant contribution to the stress—energy 
T,,”, so that w & 1/3. During the radiation epoch following “entropy generation,” 
modes successively re-enter the horizon again becoming dynamical. The degrees of 
freedom describing the modes are not the same as previously during inflation. This 
is at present our best-bet story of what likely happened in the very early universe. 
In this chapter, we shall be interested in the later history of the universe, when 
modes are re-entering the horizon. 

We shall not enter into the details of how the primordial perturbations are gen- 
erated from quantum fluctuations of the vacuum during inflation, instead referring 
the reader to the chapter by K. Sato and J. Yokoyama on Inflationary Cosmol- 
ogy for this early part of the story. Nor shall we discuss alternatives to inflation. 
The original papers in which the scalar perturbations generated from inflation were 
first calculated include Mukhanov,'*° 1°* Hawking,®® Guth,” Starobinsky,?°? and 
Bardeen et al.!! For early discussion of the generation of gravity waves during infla- 
tion and their subsequent imprint on the CMB, see Refs. 258-260, 3, and 261. A 
more pedagogical account may be found for example in the books by Liddle and 
Lyth* and by Peacock,!6 as well as in several review articles, including for example 
Ref. 126. 
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We emphasize the simplest type of cosmological perturbations, known as pri- 
mordial “adiabatic” perturbations. The fact that the perturbations are “primordial” 
means that at the late times of interest to us, only the growing adiabatic mode is 
present, because whatever the amplitude of the decaying mode may initially have 
been, sufficient time has passed for its amplitude to decay away and thus become 
irrelevant. The term “adiabatic” deserves some explanation because its customary 
meaning in thermodynamics does not precisely correspond to how it is used in the 
context of cosmological perturbations. In a universe with only adiabatic perturba- 
tions, at early times all the components initially shared a common equation of state 
and common velocity field, and moreover on surfaces of constant total density, the 
partial densities of the components are also spatially constant. This is the state of 
affairs initially, on superhorizon scales, but subsequently after the modes enter the 
horizon, the different components can and do separate. 

Adiabatic modes are the simplest possibility for the primordial cosmological 
perturbations, but they are not the only possibility. Isocurvature perturbations 
where the equation of state varies spatially are also possible, as discussed in detail 
for example in Refs. 22, 28 and 257 references therein. For isocurvature perturba- 
tions, the ratios of components vary on a constant density surface, on which the 
spatial curvature is constant as well. However the ratios between the components 
are allowed to vary. As the universe expands, these variations in the equation of 
state eventually also generate curvature perturbations. It might be argued that the 
isocurvature modes should generically be expected to have been excited with some 
nonvanishing amplitude in multi-field models of inflation. For recent constraints 
on isocurvature perturbations from the CMB, see Planck Inflation 201317! and the 
extensive list of references therein. 


4. Characterizing the Primordial Power Spectrum 


In the previous section, we defined the dimensionless invariant ¢(x), conserved on 
superhorizon scales, which can be used to characterize the primordial cosmological 
perturbations at some convenient moment in the early universe when all the relevant 
scales of interest lie far outside the horizon. Because this quantity is conserved on 
superhorizon scales, the precise moment chosen is arbitrary within a certain range. 
The existence of a quantity conserved on superhorizon scales allows us to make 
precise predictions despite our nearly complete ignorance of the details of reheating. 

Assuming a spatially flat universe, we may expand ¢(x) into Fourier modes, so 
that 


3 
¢(x) = / aayestk) expfik x]. (18) 


If spatial homogeneity and isotropy are assumed, the correlation function of the 
Fourier coefficients must take the form 


(C(k)¢(k’)) = (21)*k-°P(k)5° (k — k’), (19) 
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which also serves as the definition of the function P(k&) known as the primordial 
power spectrum. The k~? factor is present so that P(k) is likewise dimensionless. 
An exactly scale invariant power spectrum corresponds to P(k) ~ k°. The meaning 
of scale invariance can be understood by writing the mean square fluctuation at a 
point as the following integral over wavenumber: 


3 _ _ +oo 
(2(x =0)) = / d{in(k)|P(k). (20) 


Changes of scale correspond to rigid translations in In(k), and the only choice for 
P(k) that does not single out a particular scale is P(k) = (constant). 


5. Recombination, Blackbody Spectrum, and Spectral Distortions 


It is sometimes said that the CMB is the best blackbody in the universe. This 
is not true. Observationally the only way to test how close the CMB is from a 
perfect blackbody is to construct a possibly better artificial blackbody and perform 
a differential measurement between the emission from the sky and this artificial 
blackbody as a function of frequency. 

In our discussion of the CMB, we stressed that theory predicted a frequency 
spectrum having a blackbody form to a very high accuracy. This indeed was one 
of the striking predictions of the big bang theory, and the lack of any measurable 
deviation from the perfect blackbody spectral shape over a broad range of frequen- 
cies was probably one of the most important observational facts that led to the 
demise of the alternative steady state cosmological model. To date, the best mea- 
surements of the frequency spectrum of the CMB are still those made by the COBE 
FIRAS instrument.® 191,132 FTRAS!33 made differential measurements comparing 
the CMB frequency spectrum on the sky to an artificial blackbody. Despite the 
many years that have gone by since COBE and the importance of measuring the 
absolute spectrum of the CMB, no better measurement has been carried out because 
improving on FIRAS would require going to space in order to avoid the spectral 
imprint of the Earth’s atmosphere. However a space mission concept called PIXIE 
has been proposed that would essentially redo FIRAS but with over two orders of 
magnitude better sensitivity and with polarization sensitivity included.!0> While 
differences in temperature between different directions in the sky, particularly on 
small angular scales, can be measured from the ground, albeit with great difficulty, 
the same is not possible for measurements of the absolute spectrum. 

From a theoretical point of view, even in the simplest minimal cosmological 
model with no new physics, deviations from a perfect blackbody spectrum are pre- 
dicted that could be measured with improved observations of the absolute spectrum 
having a sensitivity significantly beyond that of FIRAS. It is often believed that 
when we look at the CMB, we are probing conditions in the universe at around 
last scattering — that is, around z = 1100 — and that the CMB photons are 
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unaffected by what happened during significantly earlier epochs. This is not true. 
Although when z > 1100, photons are frequently scattered by electrons random- 
izing their direction and causing them to move diffusively, this frequent scattering 
is inefficient at equilibrating the kinetic temperature of the electrons to the energy 
spectrum of the photons and vice versa owing to the smallness of the dimension- 
less parameter E,/m,-, where E, is a typical photon energy. This ratio gives the 
order of magnitude of the fractional energy exchange due to electron recoil for a 
typical collision. Given that in this limit the approach to a blackbody spectrum is 
a diffusive process, we would estimate that (E,/m,)* collisions correspond to one 
decay time of the deviation of the photon energy spectrum from the equilibrium 
spectrum. Plugging in FE, = kpTz=1100 © 0.3eV, we get (Ey/m-)? ~ 4 x 10'°! 
Thomson scattering, however, cannot equilibrate the photon number density with 
the available energy density, so with Thomson scattering alone, an initially out of 
equilibrium photon energy distribution, for example as the result of some sort of 
energy injection, would generically settle down to a photon phase space distribu- 
tion having a positive chemical potential. Processes such as Bremsstrahlung and its 
inverse or double Compton scattering are required to make the chemical potential 
decay to zero, and these processes freeze out at z ~ 10°.97 Therefore any energy 
injection between z ~ 10° and z © 10 will leave its mark on the absolute spectrum 
of the CMB photons, either in the form of a so-called py distortion, or for energy 
injected at later times, in the form of a more complicated energy dependence. 

A broad range of interesting early universe science can be explored by searching 
for deviations from a perfect blackbody spectrum. Some of the spectral distortions 
are nearly certain to be present, while others are of a more speculative nature. On 
the one hand, y-distortions in the field (that is, away from galaxy clusters where the 
y-distortion is localized and particularly large) constitute a nearly certain signal, 
as do the spectral lines from cosmological recombination. But sources like decaying 
dark matter*! may or may not be present. Another interesting source of energy 
injection arise from Silk damping?!2® of the primordial power spectrum on small 
scales, beyond the range of scales where the primordial power spectrum can be 
probed by other means. If a space-based instrument is deployed with the requisite 
sensitivity, a formidable challenge will be to distinguish these signals from each 
other, and also from the more mundane galactic and extragalactic backgrounds. 
From this discussion one might conclude, borrowing terminology from stellar astro- 
physics, that the CMB “photosphere” extends back to around z ~ 10°. 


6. Sachs—Wolfe Formula and More Exact Anisotropy Calculations 


At early times (z < 1100) the universe is almost completely ionized and the pho- 
tons scatter frequently with free charged particles, primarily with electrons because 
the proton—photon scattering cross-section is suppressed by a factor of (me/my,). 
During this period the plasma consisting of baryons, leptons, and photons may be 


TI-64 M. Bucher 


regarded as a single tightly-coupled fluid component almost inviscid but with a 
viscosity relevant on the shortest scales of interest for the calculation of the CMB 
temperature and polarization anisotropies. Later, as the universe cools the electrons 
and protons (and other nuclei) combine to form neutral atoms whose size greatly 
exceeds the thermal wavelength of the photons, rendering the universe almost trans- 
parent. Subsequent to this process, known as “recombination” (despite the fact that 
the universe had never previously been neutral), the photons cease to rescatter and 
free stream toward us today.!*° This last statement is only 90% accurate, because 
the late-time reionization starting at about z ~ 6-7 causes about 10% of the CMB 
photons to be rescattered. The effect of this rescattering on the CMB anisotropies 
will be discussed in detail in Sec. 7.5. It is during this transitional epoch, situated 
around z & 1100, that the CMB temperature and polarization anisotropies were 
imprinted. Thus when we observe the microwave anisotropies on the sky today, we 
are probing the physical conditions on a sphere approximately 47 billion light years 
in radius today,> or more precisely the intersection of our past light cone at the 
surface of constant cosmic time at z ~ 1100. This sphere is known as the surface of 
last scatter or the last scattering surface. 

The transition from a universe that is completely opaque to the blackbody 
photons, in which the tight-coupling approximation holds for the baryon—lepton 
plasma because the photon mean free path is negligible, to a universe that is vir- 
tually transparent is not instantaneous. This fact implies that the last scattering 
surface (LSS) is not infinitely thin, rather having a finite width that must be taken 
into account for calculating the small angle CMB anisotropies, because this profile 
of finite thickness smears out the small-scale three-dimensional inhomogeneities as 
they are projected onto the two-dimensional celestial sphere. Figure 11 shows a 
plot of the ionization history of the universe (not taking into account the late-time 
reionization at z > 6 mentioned above). (For a more detailed discussion of the early 
ionization history of the universe and the physics by which it is determined, see for 
example Refs. 87, 196, 197, 211 and 30). 

The first calculation of the CMB temperature anisotropies predicted in a uni- 
verse with linearized cosmological perturbations was given by Sachs and Wolfe in 
their classic 1967 paper!® (see also Ref. 150). In their treatment, the LSS surface 
is idealized as the surface of a three-dimensional sphere — in other words, the tran- 
sition from tight-coupling to transparency is idealized to be instantaneous. Locally, 
on this surface the photon—baryon fluid is subject to two kinds of perturbations: 
(i) perturbations in density 5,_» and (ii) velocity perturbations v,. The former 
translate into fluctuations in the photon blackbody temperature T,, into “intrin- 
sic” temperature fluctuations at the last scattering surface with dT,/T, and the 
second translate into a Doppler shift of the CMB temperature. If there were no 


hThe radius of this sphere in terms of today’s comoving units is larger than the age of the universe 
(approximately 13.8 Gyr) converted into a distance owing to the expansion of the universe. 
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Fig. 11. Ionization fraction as function of redshift with effect of late-time ionizing radiation 
ignored. Calculated using the code CosmoRec by Jens Chluba (for details see, Ref. 262). 


metric perturbations, we would simply have 


oT (Q) _ 9TH) _ 1s, g (21) 
Ty T; Cc 


where (? is the unit outward normal on the last scattering surface. But there are also 
additional terms due to the metric perturbations, and careful calculations along the 
perturbed geodesics yield the following modification to the above equation: 
6T,(Q) _ 6T;(Q) 
Tp oY; 


” 
“6¥, -Q4 O64 : dn(®’ +’), (22) 
ui 

where the metric perturbations are parametrized as in Eq. (15) using conformal 
Newtonian gauge.'! This equation is known as the Sachs—Wolfe formula. Because 
of the approximations involved, the Sachs—Wolfe formula is not used for accurate 
calculations (especially on small angular scales). However, it offers a good approx- 
imation to the CMB anisotropies on large angular scales and provides invaluable 
intuition that is lacking in more precise treatments. 

A few words about gauge dependence,’ an issue that renders linearized pertur- 
bation theory within the framework of general relativity somewhat messy. While the 
final observed anisotropy 6T;(Q)/T; is the same in all coordinate systems (except 


iThe Sachs—Wolfe formula as given here includes only the “scalar” perturbations. The formula can 
be generalized to include “vector” and “tensor” contributions, for which the only contributions 
are from the Doppler and ISW (integrated Sachs-Wolfe) terms. 

J“Gauge dependence” here means invariance under general coordinate transformations, which 
(consistent with the linear approximation above) may be truncated at linear order. 
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for the monopole and dipole terms), the attribution of the total anisotropy among 
the various terms (i.e., intrinsic temperature fluctuation, Doppler, gravitational 
redshift, integrated Sachs—Wolfe) depends on the choice of coordinates. We may 
for example choose coordinates so that the photon temperature serves as the time 
coordinate, making the first term disappear, or alternatively, we may make the 
Doppler term disappear by making our co-moving observers move with the local 
photon rest frame. 

Students of cosmology are sometimes misled into believing the solution to these 
ambiguities is to use conformal Newtonian gauge, or equivalently the “gauge invari- 
ant” formalism, which is equivalent to transforming to conformal Newtonian gauge. 
It is believed that this gauge choice is somehow more “physical” because at least to 
linear order, the gauge conditions lead to a unique choice of gauge. It is certainly 
nice to have a gauge condition leading to a unique choice of coordinates, but this 
uniqueness comes at a price. The Newtonian gauge condition is “nonlocal” because 
of its reliance on the decomposition of h,, into “scalar”, “vector”, and “tensor” 
components, which requires information extending all the way out to spatial infin- 
ity. But no one has ever seen all the way out to spatial infinity. For this reason, the 
Newtonian potentials UV and ® are unphysical because they cannot be measured. 
Their determination would require information extending beyond our horizon. 

We may further simplify Eq. (22) assuming that only the adiabatic growing 
mode has been excited. Using the fact that 67; and v are not independent of ®, we 
obtain the often cited result 

6T ® 


773 vi fan(o' +0), (23) 


although the —1/3 factor is not quite right because a matter-dominated evolution 

for a(t) is assumed rather than a more careful treatment taking into account matter- 

radiation equality occurring at around zeq * 3280. (For a discussion see Ref. 224.) 
Using this result and ignoring the integrated Sachs—Wolfe term, we obtain 


{Qa = yo 0), (24) 


On large angular scales — that is, large compared to the angle subtended by the 
horizon on the last scattering surface — the first term dominates. The modes labeled 
by k obey an oscillatory system of coupled ODEs, and at the putative big bang 
each mode starts with a definite sharp phase corresponding to the part of the cycle 
where v = 0. It is only at horizon crossing that the phase has evolved sufficiently 
for the velocity term to contribute appreciably to the CMB anisotropy. Figure 12 
shows the calculated form of the CMB temperature power spectrum. For @ < 100, 
AT/T = —®/3 provides an adequate explanation for this leftmost plateau of the 
curve, but at higher @ a rise to a first acoustic peak, situated at about ¢ 220, is 
observed followed by a series of decaying secondary oscillations. 
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Fig. 12. Theoretical CMB spectrum. CMB temperature power spectrum predicted for a model 
with only the adiabatic growing mode excited, as in the standard concordance cosmological model, 
is shown. The theoretical model here assumes the best-fit cosmological parameters taken from the 
Planck 2013 Results for Cosmology.!66 


A more accurate integration of the evolution of the adiabatic mode up to last 
scattering can be obtained in the fluid approximation. This approximation assumes 
that the stress-energy content of the universe can be described by a fluid descrip- 
tion consisting of two components coupled to each other only by gravity.!°? However 
more accurate calculations must go beyond the fluid approximation using a Boltz- 
mann formalism that includes higher order moments. 

However, before describing the Boltzmann formalism, capable of describing this 
intermediate regime for the photons, we note one final shortcoming of the approxi- 
mation with tight-coupling and instantaneous recombination — namely, the treat- 
ment of polarization. We present here an approximate, heuristic treatment in order 
to provide intuition. 

The scattering of photons by electrons is polarization dependent, and this effect 
leads to a polarization of the CMB anisotropy when the fact that recombination 
does not occur instantaneously is taken into account. The Thomson scattering 
cross-section of a photon off an electron is given by 


do e2 \? 2 
— = (—; é,- Ef)”, 25 
aa ~ (Sa) en (25) 
where €; and €, are the initial and final polarization vectors, respectively. 

To see qualitatively how the polarization of the CMB comes about, let us 
for the moment assume that the photons coming from next-to-last scattering are 
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Fig. 13. Anisotropy of polarized Thomson scattering and the origin of the CMB polarization. 
We show two examples of photon scattering from (NLS) (next-to-last scattering) to (LS) (last 
scattering) and finally to the observer (at O). The three segments in the diagram have been 
chosen mutually at right angles in order to maximize the effect, and for simplicity we assume 
that the radiation emanating from (NLS); and (NLS)2 is unpolarized. All the radiation that 
scatters coming from (NLS); to (LS) and then towards O is completely linearly polarized in the 
€2 direction, because the other polarization is parallel to (LS)O and thus cannot be scattered in 
that direction. Likewise, the scattered photons emanating from (NLS)2 are completely linearly 
polarized in the orthogonal direction for the same reason. Consequently if there were only two 
sources as above, measuring the Stokes parameter Q at O amounts to measuring the difference in 
intensity between these two sources. 


unpolarized and calculate the polarization introduced at last scattering, as indi- 
cated in Fig. 13. Measuring photons with one linear polarization selects photons 
that propagated from next-to-last to last scattering at a small angle to the axis of 
linear polarization, while photons with the other linear polarization tend to come 
from a direction from next-to-last to last scattering with a small angle with the 
other axis. Consequently, measuring the polarization amounts to measuring the 
temperature quadrupole as seen by the electron of last scattering. If we make the 
simplifying assumption that the radiation emanating from next-to-last scattering 
is unpolarized, we obtain the following expression for the Stokes parameters from 
the linear polarization, 


Q Po Pmas a, cos @ 
a 7 = [, d(—exp[—T(r1.s. +))) [a0 sin? 6 cs _ 


x T(psin 6 cos ¢, psin @ sin ¢, ris. + pcos8). (26) 


Here we assume that the line of sight is along the z direction. The expression gives 
two of the five components of quadrupole moment of the temperature as seen by 
the electron at last scattering, and after integration along the line of sight would 
give the total linear polarization with the polarization at next-to-last scattering 
neglected. 
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7. What Can We Learn From the CMB Temperature and 
Polarization Anisotropies? 


The previous section showed how starting from Gaussian isotropic and homoge- 
neous initial conditions on superhorizon scales, the predicted CMB temperature 
and polarization anisotropies are calculated, describing in detail all the relevant 
physical processes at play. In this section, we turn to the question of what we 
can learn about the universe from these observations. We focus on how to exploit 
the temperature and polarization two-point functions, which under the assumption 
of Gaussianity would summarize all the available information characterizing the 
underlying stochastic process. Gaussianity is a hypothesis to be tested using the 
data, as discussed separately in Sec. 9. 


7.1. Character of primordial perturbations: Adiabatic growing 
mode versus field ordering 


In the aftermath of the COBE/DMR announcement of the detection of the CMB 
temperature anisotropy, two paradigms offered competing explanations for the ori- 
gin of structure in the early universe. On the one hand, there was cosmic inflation, 
which in its simplest incarnations predicts homogeneous and isotropic Gaussian ini- 
tial perturbations where only the adiabatic growing mode is excited. On the other 
hand, there was also another class of models in which the universe is postulated 
initially to have been perfectly homogeneous and isotropic. However, subsequently 
a symmetry breaking phase transition takes place, with the order parameter field 
taking uncorrelated values in causally disconnected regions of spacetime. Then, as 
the universe expands and the co-moving size of the horizon grows, the field aligns 
itself over domains of increasingly large co-moving size, generally in a self-similar 
way. In many of these models, topological defects of varying co-dimension — such 
as monopoles, cosmic strings, and domain walls — arise after the phase transition, 
but textures where the spatial gradient energy is more diffusely distributed are also 
possible. In these models the contribution of the field ordering sector to the total 
energy is always subdominant, and cosmological perturbations are generated in a 
continuous manner extending to the present time. The stress—energy 0, from this 
sector sources perturbations in the metric perturbation h,,. These metric pertur- 
bations in turn generate perturbations in the dominant components contributing to 
the stress—energy: the baryons, photons, neutrinos, and cold dark matter. In these 
models, the cosmological perturbations are not primordial, but rather are continu- 
ously generated so that the perturbations on the scale k are generated primarily as 
the mode k enters the apparent horizon.!°? This property implies that the decay- 
ing mode as well as the growing mode of the adiabatic perturbations is excited, 
and this fact has a spectacular effect on the shape of the predicted CMB power 


kFor the physics of topological defects, see Ref. 184, and for a comprehensive account emphasizing 
the connection to cosmology, see Ref. 217. 
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spectrum. While inflationary models, which excite only the growing mode, predict 
a series of sharp, well-defined acoustic oscillations, field ordering models predict 
broad, washed out oscillations or no oscillations at all in the angular CMB power 
spectrum.*°!5! Heuristically one can understand this behavior by arguing that the 
positions of the peaks reflect the phase of the oscillations. Therefore, if both the 
growing and decaying modes are present in exactly equal proportion, there should 
be no oscillations in the angular spectrum. However, precise predictions require 
difficult numerical simulations and depend on the precise model for the topological 
defect or field ordering sector. Figure 14 shows the shape of the predicted CMB TT 
power spectra for some field ordering models, to be compared with the predictions 
for minimal inflation shown in Fig. 12. 

After COBE, a big question was whether improved degree-scale CMB obser- 
vations would unveil the first Doppler peak followed by a succession of decaying 
secondary peaks at smaller angular scales, as predicted by an approximately scale 
invariant primordial power spectrum for the growing adiabatic mode, or whether 
some other shape would be observed, perhaps the one predicted by a field ordering 
model. 
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Fig. 14. Structure of acoustic oscillations: field ordering predictions. These acoustic oscillation 
predictions from four field-ordering models are to be contrasted with the sharp, well-defined peaks 


in Fig. 12 as predicted by inflation. (Reprinted with permission from Ref. 151.) (Credit: Pen, Seljak 
and Turok) 
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We already presented (in Sec. 1) the experimental results for the first observa- 
tions of the acoustic oscillations and their subsequent precise mapping (see in partic- 
ular Figs. 4, 5 and 10.) The data clearly favor simple models of inflation producing 
adiabatic growing mode perturbations with an approximately scale invariant power 
spectrum and exclude scenarios where topological defects serve as the primary 
source for the initial density perturbations. However, models including a small 
admixture of defect induced perturbations cannot be ruled out.!”4 


7.2. Boltzmann hierarchy evolution 


In the previous section, we derived the Sachs—Wolfe formula, which provides an 
intuitive understanding of how the CMB anisotropies are imprinted. The Sachs— 
Wolfe treatment provides an approximate calculation of the CMB temperature 
anisotropies, correctly capturing their qualitative features. However to calculate 
predictions accurate at the sub-percent level, as is required for confronting current 
observations to theoretical models, the transition from the tight-coupling regime 
to transparency, or the free-streaming regime, must be modeled more realistically. 
This requires going beyond the fluid approximation, where the photon gas can be 
described by its first few moments. Modeling this transition faithfully requires a 
formalism with an infinite number of moments — that is, a formalism in which the 
fundamental dynamical variable is the photon phase space distribution function 
f(x,t,v,n, €) where v is the photon frequency, and the unit vectors n and € denote 
the photon propagation direction and electric field polarization, respectively. The 
perturbations about a perfect blackbody spectrum are small, so truncating at lin- 
ear order provides an adequate approximation. Moreover, Thomson scattering is 
independent of frequency, and thus (to linear order) does not alter the blackbody 
character of the spectrum. Consequently, a description in terms of a perturbation 
of the blackbody temperature that depends on spacetime position, propagation 
direction, and polarization direction can be used in place of the full six-dimensional 
phase space density, thus reducing the number of dimensions of the phase space by 
one.! For a calculation correct to second order, a description in terms of a perturbed 
temperature would not suffice. 

As before we consider a single “scalar” Fourier mode k of flat three-dimensional 
space, so that an exp[ik-x] spatial dependence is implied. A complication in 
describing polarization arises in specifying a basis, which preferably is accomplished 
using only scalar quantities. We describe the polarization using the unit vectors n 
and k to define the unit vectors 


’ (Sees (27) 
n xX 


‘Although written as a continuum variable, é is in reality a discrete variable corresponding to the 
Stokes parameters (J, Q and U). 
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which together with photon propagation direction n form an orthonormal basis. 
We thus describe the Stokes parameters of the radiation propagating along ni as 
follows: 


(Eq (fi, v)Ey (a, v)) = I(A, v;k) ban + Q(, v;k)[O @ 6 — 6 @ Plan. (28) 


For the “scalar” mode, the Stokes parameters U and V vanish in this basis. 

The Thomson scattering cross-section does not vary with frequency, a property 
that provides substantial simplification. Suppose that at some initial time the pho- 
ton distribution function has a blackbody dependence on frequency, with a black- 
body temperature suffering only small linearized perturbations about an average 
temperature, and that these temperature perturbations depend on the spacetime 
position, propagation direction, and linear polarization direction. The frequency 
independence of Thomson scattering ensures that a photon distribution function 
initially having this special form will retain this special form during its subse- 
quent time evolution. Initially, when tight-coupling is an excellent approximation, 
any deviation in the frequency dependence from a perfect blackbody spectrum 
decays rapidly. Moreover, any initial polarization is erased by the frequent scat- 
tering. Therefore a blackbody form described by a perturbation in the spectral 
radiance of the form 


618,23, 0.S 7, A ali, xx, tee” (29) 


is well motivated. Here 


BT) = (=) (ew | : i) (30) 


is the spectral radiance for a blackbody at temperature 7’. The unperturbed spectral 
radiance is [,,o(n,€,x,t) = B,(T,(t)), where T,(t) is the photon temperature at 
cosmic time t in the unperturbed background expanding FLRW spacetime. (In 
Sec. 5, however, we discuss some interesting possible caveats to this assumption.) 
For the “scalar” mode of wavenumber k, we may decompose 


Aao(a, x, t) = AS (A, x, t)6an + AS (A, x, t) (6295 — bad). (31) 


We now turn to the evolution of the variables AS) (n,t) and A®&) (A, t). It is 
convenient, exploiting the symmetry under rotations about k, to expand 
AP? (atk) = > APD (k)(—a! (26+ 1) Pelk- A), 
l= 
° (32) 


AS (i, 3k) = S> ALP (tk) (—i)*(2l + 1) Pe(k- A). 
£=0 
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The evolution of NSS 


equations!?’: 


and AC is governed by the following infinite system of 


scl)’ scl 
AY —kAS +¢, 


scl)’ k scl scl scl 
Ag” = S (AGS? — 2055 + 4) + a(t)ne(tor (2 - AM), 


scl)’ k scl scl II scl 
NSS — 5 (2Ar )- oar ’) + a(t)ne(thor (5+ + ANS ') 
scl)’ k (scl scl sc 
AG — ara fr e_ay — CFDARC y} — a(t)ne(thor Age, for £> 3, 
scl)’ k scl scl) scl 
Ape = Bp gay laren — C+ DARE Ly} — ane or Ape” 
1 1 
+ galt)ne(“orll (5:0 + sia), for 2>0, (33) 
where 
U-— AS) 4 Ae (scl) 42 AE) (34) 


These equations are known as the Boltzmann hierarchy for the photon phase space 
distribution function. In the last line of Eq. (33) da» denotes the Kronecker delta 
function. For practical numerical calculations, this infinite set of coupled equations 
must be truncated at some lmax- 

In principle, ignoring questions of computational efficiency, we could truncate 
the system of equations in (33) at some large (ax, sufficiently above the maximum 
multipole number of interest today, and integrate the coupled system from some 
sufficiently early initial time to the present time to to find the “scalar” temperature 
and polarization anisotropies today, which would be given by the following integrals 
over plane wave modes: 


AS) (fi, to) = i} PKAE (i, tok), 
(35) 
AS” (A, to) = / PRAD (A, to; k). 


This was the approach used in the COSMICS code,'® one of the first codes pro- 
viding accurate computations of the CMB angular power spectrum. However in 
this approach much computational effort is expended at late times when there is 
virtually no scattering. 

A computationally simpler but mathematically completely equivalent approach 
known as the line of sight formalism was introduced in Ref. 200. The crucial idea is 
to express the anisotropies today (at x = 0 and 7 = 79) in terms of a line of sight 
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integral, so that 


AY’ @)= [ " dn exp[—T(n)] exp[tk(7 — no)] 


x I(-E) (410 + igon + 5Fa(u)tt) Gi ine) (36) 


and 


Ap'(a) = J dnexpl—r(n)]explik(n — 10) ( =) (1~ Pa(u)) (37) 
where IT = ASD +A) AO). We may think of exp|—r(7)](—dr/dn)dn = V(n)dn as 
a measure along the line of sight weighted according to where last scattering takes 
place. We shall call V(7) the visibility function. In Eq. (36), the first term represents 
the nonpolarized contribution at last scatter and the second term represents the 
contribution from the integrated Sachs—Wolfe effect. In Eq. (37) for the polarized 
anisotropy, the only contribution is from last scatter. It is convenient to rewrite the 
above integrals using integration by parts so that all spatial derivatives along the 
line of sight and occurrences of jz are eliminated, and in the process we also omit 
the monopole surface term at the endpoint corresponding to the observer as this 
contribution is not measurable. After this integration by parts, the integrals above 
take the form 


(3) ie (3) 
Ay p(fa) = i: dn expliku(no — 1)|Sp-p((no0 — 18, 0), (38) 
where the scalar source functions are 


Sf = exp[-ria)l(o! +v) +V (Aro 6+ B+ F 


Lye { Ub, 3Il’ , oll 39 
eal ae) tae (39) 
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and the primes denote derivatives with respect to conformal time 7. Now that 
the spatial derivatives have been eliminated, it is no longer necessary to use the 
plane waves exp|ikj(7 — 7)| as the eigenfunctions of the Laplacian operator with 
eigenvalue —k?. It is convenient instead to use a spherical wave expansion with 


je(k(no — 7) 
A= anie(sem —) 89 ((m nau), (40) 


where A=T,P. 

The line of sight formulation results in a significant reduction of the computa- 
tional effort required for computing the high ¢ coefficients for two reasons. First, 
the Boltzmann hierarchy can be truncated at low @, because only moments up to 
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¢ = 2 appear in the integral. Second, when the universe has become transparent, 
there is no longer any contribution from the Thomson scattering. 

We express the initial state for the growing adiabatic mode at some early refer- 
ence time whose exact value must be chosen after the epoch of entropy generation 
following inflation but before any of the relevant modes re-entered the horizon. We 
use the variable ¢(x), whose value is conserved on superhorizon scales, to character- 
ize the primordial perturbations in the adiabatic growing mode. We may expand, 
either using plane wave Fourier modes or a spherical wave expansion, so that 


Coe) = f akc (k) explik x}, (41) 
or 
ioe) +2£ oo 
Co) => YS f adkcem()je(br)¥im (8,9). (42) 
£=0 m=—£79 


We characterize the homogeneous and isotropic Gaussian statistical ensemble 
for ¢(x) by means of a power spectrum P(k), which suffices to completely define 
the statistical process generating ¢(x). We define P(k) according to the following 
expectation values: 


(¢(k)¢(k’)) = (277)?0°(k — k')P(k), (43) 
or equivalently 
(Com (ke) Gorm: (k’)) = 8¢,0/5mnm'5(k — k’YP(h). (44) 
It follows that 
Cyr. [axl ae _ my) 56) 
ate = ak [ands km —m) 3$ (1.8) (B) (45) 
so that 
Cpe = (lam "atm ) = it © dkP(RASM RAIT (A), (46) 
0 
where 
Af") = f° anje( km —0)) 5% (0,4). (a7) 
0 


The above approach was first implemented in the publicly available code CMB- 
FAST? and later in the code CAMB.'*!™ More details about the computational 
issues may be found in these papers. 


™For a download and information on CAMB, see www.camb.info. 
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7.3. Angular diameter distance 


The radius of the last scattering surface expressed in terms of co-moving units today 
(for example in light-years or Mpc) is given by the integral 


1 
da 1 
s=H) / 48 
si a a? /2,a-4 + Qma-? +0 ( ) 


ls 


for a universe whose stress—energy content includes radiation, matter, and a cos- 
mological constant. Here the scale factor is given by aj; = (zis + 1)~+. For other 
equations of state the argument of the square root above is modified accordingly. A 
physical scale d at last scattering is converted to a co-moving scale today x accord- 
ing to « = (zs, + 1)d, and under the assumption of a flat spatial geometry, the 
angle in radians subtended by an arc of length d on the last scattering sphere is 
d= (d/dis). 

Except for the integrated Sachs—Wolfe effect (discussed below in Sec. 7.4), which 
is very hard to measure because the S/N is negligible but for the very first mul- 
tipoles, the projection effect described above is the only way in which late-time 
physics enters into determining the angular power spectrum. Possible spatial cur- 
vature, parameterized through Q,, the details of a possible quintessence field, or 
similar new physics that alter the expansion history at late time enters into the CMB 
power spectra only through a rescaling of the relation between angular scales, on 
the one hand, and physical scales, on the other, around last scattering, and this 
rescaling can be encapsulated into a single variable d),. 

If the family of FLRW solutions is extended to admit spatial manifolds of con- 
stant positive or negative curvature, the expression for the angular size must be 
modified to the following for a negatively curved (hyperbolic) universe 


p= dig od (49) 
sinh[Q}/ dis] Ms 


or in the case of positive curvature (i.e., a spherical universe) 


(—Qx)'/?dis 
d= . 
sin[(—0;)!/2di] dis 89) 


In the former case, the additional prefactor giving the contribution for the non- 
Euclidean character of the geometry has a demagnifying effect and increases the 
angular diameter distance, whereas for the spherical case the effect is the opposite. 

In the early 1990s, many theorists working in early universe cosmology fer- 
vently believed on grounds of simplicity in a cosmological model with Q, = 0 
(supposedly an inexorable prediction of inflation) and no cosmological constant 
(regarded as extremely finely tuned and “unnatural”) containing only matter and 
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radiation so that Q, +Q, = 1 exactly. This belief was maintained despite a num- 
ber of discordant astronomical observations. Many in the astronomical community, 
however, rather adopted the attitude “I believe only what I see,” 
some kind of a low-density universe, either a negatively curved universe where 
Q, = 1-—Q,_, — Q, or a universe with a nonzero cosmological constant, where 
Qa, =1-O,_, - Q,.7 

Although it was widely claimed that spatial flatness was a prediction of inflation, 
the basic idea of how to produce a negatively curved universe within the framework 
of inflation had already been proposed in 1982 by Gott and Statler.’?7? However, 
since they had not calculated the perturbations predicted in such a model, it was 
not clear that such a model would work. In 1995 Bucher et al.?&?7 and Yamamoto 
et al.?°? calculated the perturbations for single bubble open inflation, showing them 
to be consistent with all the observations available at the time. Present constraints 


from the CMB limit |Q, —1| to a few percent at most, the exact number depending 
166,171 


and thus favored 


on the precise parametric model assumed. 


7.4. Integrated Sachs—Wolfe effect 


The discussion above claimed that the impact of late-time physics on the CMB 
power spectrum could be completely encapsulated into a single parameter: the 
angular diameter distance to last scattering d},. This is only approximately true, 
because the geometric argument above assumed that all the CMB anisotropy can 
be localized on the last scattering surface and ignored the integrated Sachs—Wolfe 
term in Eq. (22), which gives the part of the linearized CMB anisotropy imprinted 
after last scattering. 

Intuitively, the integrated Sachs-Wolfe term can be understood as follows. The 
Newtonian gravitational potential blueshifts the CMB photons as they fall into 
potential wells and similarly redshifts them as they again climb out. If the depth 
of the potential well does not change with time, the two effects cancel. In this case, 
there is no integrated Sachs—Wolfe contribution because the integral can be reduced 
to the sum of two surface terms. But if the depth of the potential well changes with 
time, in particular if the overall scale of the potential is decaying, the two effects 
no longer cancel and an integrated Sachs—Wolfe contribution is imprinted. 

In the linear theory, the gravitational potential at late times may be factorized 
in the following way: 


®(x,t) = ®(x)T(t), (51) 


and we may normalize T(t) to one when the universe has become matter dominated, 
but before the cosmological constant — or some other dark energy sector — has 
started to alter the expansion history. 


"For a snapshot of the debates in cosmology in the mid-1990s, see the proceedings of the conference 
Critical dialogues in cosmology held in 1995 at Princeton University.?!% 
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From the CMB anisotropies alone, it is hard to determine the contribution of the 
integrated Sachs—Wolfe effect to the total CMB anisotropy on large angular scales. 
However one can try to isolate its contribution by measuring the cross-correlation 
of the CMB temperature anisotropy with the large-scale structure using a broad 
radial window function extending to large redshifts as was first proposed in Ref. 37, 
whose aim was to find evidence for a nonzero cosmological constant. 

The evolution equation for the linearized density contrast of a single pressureless 
matter component is48 


bm(t) + 2H (t)dm(t) — : 


Hm (Eimlt) =0, (52) 
where the dots denote derivatives with respect to proper time and H(t) = a(t)/a(t). 
Here we treat the baryons and the CDM (cold dark matter) as a single component 
ignoring any relative velocity between them. This is not a bad approximation at 
late times, particularly on very large scales. 

For an Einstein-de Sitter universe (for which Q, is negligible and Qa, = 0), 
which is a good approximation for our universe at intermediate redshifts, because 
for z S Zeq © 3.4 x 10% radiation is unimportant and after the decoupling of 
the photons and baryons, it is valid to set the velocity of sound equal to zero as 
above. Let zm~ = 1/(1+@ma) be the redshift where the nonrelativistic matter and 
the putative cosmological constant contribute equally to the mean density of the 
universe, so that Qm = @ma?/(1+@ma?) and Q, = 1/(1+ ama?). For Qa = 0.68, 
for example, we would have z,,, = 0.3 and ama = 0.78. For the Einstein—de Sitter 
case, a(t) = t?/3 and the growing and decaying solution to Eq. (52) are ¢?/% and 
t~1, respectively. (1/a?)V7¢ = (3/2)H?Qm6m, and for the growing mode ®(x) is 
independent of time, implying that there is no integrated Sachs—Wolfe contribution 
at late times. However, as the cosmological constant begins to take over and Qa 
starts to rise toward one, ® starts to decay, as sketched in Fig. 15. 

As explained in detail in Ref. 37, the maximum signal-to-noise (S/N) that can 
be extracted from the ISW correlation, even with an ideal measurement of the 
large-scale structure anisotropy within our past light cone, is modest because of 
the “noise” from the primary CMB anisotropies emanating from the last scattering 
surface. This cosmic variance noise cuts off the available S/N because on small 
angular scales, the relative contribution from the ISW term plummets. The S/N is 
centered around ¢ = 20 and integrates to about S/N ~ 6.5 (assuming Qo = 0.7). 
For an early detection of a correlation, see Ref. 25, and for the most recent work in 
this area see Ref. 168 and references therein. 


7.5. Retonization 


According to our best current understanding of the ionization history of the uni- 
verse, the early ionization fraction around recombination is accurately modeled 
assuming homogeneity and using atomic rate and radiative transfer equations. As 
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Fig. 15. Decay of the gravitational potential for the late-time integrated Sachs—Wolfe effect. We 
sketch the time dependence of the gravitational potential for a universe with Qa9 = 0.9, 0.8, 0.7, 
and 0.6 (from top to bottom on the right) with a = 1 today. The qualitative behavior will be 
similar for other kinds of dark energy. 


a rough estimate, one may assume the reionization fraction predicted by equilib- 
rium thermodynamics given by the Saha equation, in which there is an exponen- 
$, which occurs at about 
z & 1270, the ionization fraction x(z) rapidly decays to almost zero, except for 
trace amounts of ionized hydrogen (denoted as HII in the astrophysical literature) 
and of free electrons, due to the inefficiency of recombination in the presence of 
trace concentrations. If it were not for inhomogeneities and structure formation, 
this nearly perfect neutral phase would persist to the present day. However, the 
initially small primordial inhomogeneities grow and subsequently evolve nonlin- 
early. The first highly nonlinear, gravitationally collapsed regions give rise to the 
first generation of stars and quasars, which serve as a source of UV radiation that 
reionizes the neutral hydrogen almost completely, except in a minority of regions 
of extremely high gas density, where recombination occurs sufficiently frequently to 
counteract the ionizing flux of UV photons. 

The first observational evidence for such nearly complete reionization came from 
observations of the spectra of quasars at large redshift in the interval 10.4eV/(1 + 
Zquasar) < Ey < 10.4eV — that is, blueward of the redshifted Lyman a line.” If the 
majority of the hydrogen along the line of sight to the quasar were neutral, resonant 
Lya (2p — 1s) scattering would deflect the photons in this frequency range out of 
the line of sight, and this part of the spectrum would be devoid of photons. The 
fact that the spectrum is not completely blocked in this range indicates that the 
universe along the line of sight was reionized. Almost complete absorption by neutral 
hydrogen — that is, the so-called Gunn—Peterson trough — was not observed in 


tial sensitivity to changes in temperature. After x & 
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quasar spectra until 2001, when spectroscopic follow-up of several high-redshift 
quasars discovered by the Sloan Digital Sky Survey (SDSS) detected such a trough 
from z & 5.7 to z © 6.3, where the continuum emission blueward of the Lya appears 
to have been completely absorbed by neutral hydrogen.!4 

The big question here is exactly when did the universe first become reionized. 
Or said another way: When did the first generation of stars and quasars form and 
become capable of generating enough ionizing radiation to convert the neutral gas 
to its present ionized state? 

This is a vast subject of very current research, sometimes described as the explo- 
ration of the “Dark Ages” of the universe. While we believe that we know a lot about 
the conditions around the time when the CMB anisotropies were imprinted, and 
we also know a lot about the recent universe extending to moderate redshift, little 
is known with any degree of certainty concerning this intermediate epoch. Good 
reviews of this subject include Ref. 12. Future observations of the 21 cm hyperfine 
transition of atomic hydrogen (HI) by the next generation of radio telescopes, in par- 
ticular the Square Kilometer Array (SKA),° promise to provide three-dimensional 
maps of the neutral hydrogen in our past light cone. 

Here we limit ourselves to discussing the impact of reionization on the CMB 
anisotropies. The most simplistic model — or caricature — of reionization pos- 
tulates that the ionization fraction x(z) changes instantaneously from x = 0 for 
Z > 2reion to © = 1 for z < Zreion. Under this assumption, the optical depth for 
rescattering by the reionized electrons is 


Zreion dl 
T= [ dzorne(z) 7. (53) 


The Thomson scattering optical depth from redshift z to the present day is given 
by the integral 


aoe cor ‘i da ne(a) 


Ho Jay @ (Qe? Opa +, 
_ COTNeo [ da Xe(a) (54) 
— Hp ate) at (Q,a-4 + Q,,4a73 +4 Q,)1/2 ’ 


where a(z) = 1/(z+1), or is the Thomson scattering cross-section, Ho is the present 
value of the Hubble constant (in units of inverse length), and neo is the electron 
density today, assuming x, = 1. The functions n-(a) and x(a) are the electron 
density? and the ionization fraction at redshift z = 1/a — 1, respectively. 


°https: //www.skatelescope.org/. 
PIt is customary to define re as ne/ng, so that we slightly exceeds one when the helium is 
completely ionized as well. 
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On small angular scales, the effect of reionization on the predicted CMB tem- 
perature and polarization anisotropy spectra is straightforward to calculate. Of 
the CMB photons emanating from the LSS, a fraction exp(—r) does not undergo 
rescattering by the reionized electrons and their fractional temperature perturba- 
tions are preserved. However, for the remaining fraction (1 — exp(—T)) © 7 that 
is rescattered, the small-angle CMB anisotropies are completely erased. The reion- 
ized electrons nearly completely wash out small-scale detail much like a window 
pane of ground glass. The net effect is that the small-scale CMB anisotropies are 
attenuated by a factor of exp(—T) in amplitude and by a factor of exp(—2r) in the 
power spectrum. On larger angular scales, the attenuation of the anisotropy from 
the rescattered photon component is incomplete, and a more detailed calculation is 
needed to determine the detailed form of the ¢-dependent attenuation factor. The 
order of magnitude of the angular scale where the attenuation starts to become 
incomplete is given by the ratio of the co-moving distance travelled from next-to- 
last to last scattering by a rescattered photon dyeion to the radius of the LSS dys. 
We thus obtain an angular scale O;eion © dreion/dys for this transition. 

For determining the cosmological model from the C'pp power spectrum alone, 
the reionization optical depth is highly degenerate with the overall amplitude of the 
cosmological perturbations A. This is because except on very large scales, where cos- 
mic variance introduces substantial uncertainty, it is the combination A exp(—2r) 
that determines the observed amplitude of the CMB angular power spectrum. 

The polarization power spectra, meaning C®” and also CTY, help to break 
this degeneracy by providing a rather model independent measurement of Tyeion- 
Recall our order of magnitude estimate for the polarization introduced by the finite 
thickness of the LSS 
OT 
Bx2’ 
where d and «x are expressed in co-moving units and d is the mean co-moving dis- 
tance that the photon travels from next-to-last scattering to last scattering. The 
second derivative is that of the temperature anisotropy evaluated at the position of 


Px@ (G5) 


a typical electron of last scatter. Let us compare the polarization imprinted during 
recombination to that imprinted later for the fraction of photons (1 — exp(—rT)) & rT 
rescattered by the free electrons from reionization, for the moment assuming 
that the second derivative factors are comparable for the two cases. However, 
drec < dreion; therefore, we may expect that Preion/Prec © T(drec/dreion)? >> 1. 
This polarization from reionization, however, is concentrated almost exclusively in 
the first few multipoles, because directions separated by a small angle see the same 
quadrupole anisotropy from last scattering from the vantage point of a typical free 
electron after reionization. Figure 16 shows the scalar power spectra for four val- 
ues of 7 keeping the other cosmological parameters fixed. We shall see that the 
same amplification applies to the tensor modes, or primordial gravitational waves, 
presumably generated during inflation. 
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Fig. 16. CMB power spectra for four values of the reionization optical depth. We plot the indi- 
cated power spectra for 7 = 0.00, 0.04, 0.08, and 0.12 assuming the same concordance values for 
the other cosmological parameters. We multiply the spectra by exp(27T) in order to remove the 
trivial dependence on 7 at small angular scales and thus highlight what occurs at low £@. 


We now turn to observations allowing us to fix the value of 7 using the low @ 
C¥F spectra and CT spectra based on the qualitative theoretical arguments above 
but made more precise by calculating + within the framework of the previously 
described six-parameter concordance model. While the first observations of the 
polarization of the CMB were reported by the DASI experiment,!!! it was the 
WMAP large-angle polarization results, first the TE correlations reported in the 
one-year release, later followed by the EE correlation results from the third-year 
results, that provided the first determination of Treion using the CMB, and sub- 
sequently refined in later updates to the WMAP results with more integration 
time. As part of their first-year results,?*! the WMAP team reported a value for 
the reionization optical depth of r = 0.17 + 0.04 at 68% confidence level, which 
would correspond to 11 < z, < 30 at 95% confidence where a step function profile 
for the ionization fraction is assumed. This determination was based on exploit- 
ing only the measurement of the TE cross-correlation power spectrum, because 
for this first release the analysis of the polarized sky maps was not sufficiently 
advanced to include a reliable EE auto-correlation power spectrum measurement. 
The reported value of t was larger than expected. In their third-year release,?°" 
when WMAP presented a full analysis of the polarization with EE included, the 
determination of 7 shifted downward, namely to rT = 0.10 + 0.03 (using only EE 
data) and r = 0.09 + 0.03 with (TT, TE and EE all included). The final nine-year 
WMAP release?®! cites a value of tT © 0.089 + 0.014 using WMAP alone. Including 
other external data sets results in small shifts about this value. 
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The Planck 2013 Results for Cosmology'®!:!66:!71 release did not include an 
analysis of the Planck polarization data. Therefore the likelihoods used to determine 
the cosmological parameters included the WMAP determination of 7 as a prior, and 
in some cases the WMAP low-@ polarization likelihood was used in order to break 
the degeneracy between 7 and A described above that arises when only temperature 
data is used. 

It has been pointed out that reionization histories, which are completely char- 
acterized by function 2,.(z), cannot be reduced to a single number. Some work has 
been done to characterize what further information can be extracted from the low-é 
CMB concerning the reionization history, as described in Refs. 95 and 101, but the 
conclusion is that from the CMB data alone at most a few nonzero numbers can 
be extracted at S/N > 1 given the large cosmic variance at low @. 

In the above discussion we have assumed that reionization occurs homogeneously 
in space, an approximation adequate for calculating its effect on the anisotropies at 
large and intermediate angular scales. However, according to our best understand- 
ing, the universe becomes reionized by the formation of bubbles, or Strémgren 
spheres, surrounding the first sources of ionizing radiation, which grow and become 
more numerous, and eventually percolate. While this basic picture is highly plausi- 
ble, the details of modeling exactly how this happens remain speculative. The effect 
on the CMB is significant only at very small angular scales, far into the damping tail 
of the primordial anisotropies. Power from the low-@ anisotropies is transformed into 
power at very high ¢. This can be understood by considering the power in Fourier 
space resulting from the rather sharp bubble walls, which in projection look almost 
like jump discontinuities. Some simulations of this effect, along with other effects 
at large @, can be found for example in Ref. 253. 


7.6. What we have not mentioned 


The preceding subsections provide a sampling of what can be learned from the 
CMB TT and polarization power spectra with each point discussed in some detail. 
For lack of space, we cannot cover all the parameters and extensions of the simple 
six-parameter concordance model that can be explored, and here we provide a few 
words regarding what we have not covered giving some general references for more 
details. 

The structure of the acoustic oscillations and the damping tail (relative heights, 
positions, etc.) of the CMB power spectra depends on the cosmological parameters, 
as forecast and explained pedagogically in Refs. 92, 99, and most recently applied 
to the Planck 2013 data for determining the cosmological parameters.!®° Some 
of the constraints are obtained in the framework of the standard six-parameter 
concordance model, but many extensions to this model can be constrained, such 
as theories with varying @gzpp, nonminimal numbers of neutrinos, varying neu- 
trino masses, isocurvature modes, nonstandard recombination, to name just a few 
examples. Moreover the question can be addressed whether there exists statistically 
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significant evidence in favor of extending the basic six-parameter model to a model 
having additional parameters. See Refs. 166 and 171 for an extensive set of refer- 
ences. 


8. Gravitational Lensing of the CMB 


The previous sections presented a simplified view of the CMB where the CMB 
anisotropies are imprinted on the so-called last scattering surface, situated around 
z & 1100. It is assumed that we look back to this surface along straight line geodesics 
in the unperturbed coordinate system. This is not a bad approximation, but it is 
not the whole story. Inhomogeneities in the distribution of matter, particularly at 
late times when the clustering of matter has become nonlinear, act to curve the 
photon, or geometric optics, trajectories. These nonuniform deflections distort the 
appearance of the surface of last scatter, much like a fun house mirror. Gravitational 
lensing of the CMB is a small effect but large enough so that it has to be taken into 
account in order to compare models of the primordial universe with the observations 
correctly.?°4 In one sense, gravitational lensing may be viewed as a nuisance effect 
to be removed. But gravitational lensing of the CMB may also be regarded as an 
invaluable tool for probing the inhomogeneities of the matter distribution between 
us and the last scattering surface. Gravitational lensing is presently a very active 
area of astronomy, and the CMB is only one of the many types of objects whose 
gravitational lensing may be measured in order to probe the underlying mass inho- 
mogeneities in the universe. Gravitational lensing of the CMB may be contrasted 
with competing weak lensing probes in that: (i) for the CMB, the sources (i.e., 
the objects being lensed) are the most distant possible, situated at z ~ 1100, (ii) 
because of the above, CMB lensing is sensitive to clustering at larger redshifts, and 
thus less sensitive to nonlinear corrections, and (iii) unlike observations of the cor- 
relations of the observed ellipticities of galaxies, where there are systematic errors 
due to intrinsic alignments, for CMB lensing there is no such problem. 

In the linear theory (which is a good approximation, especially for a heuristic 
discussion), the deflection due to lensing may be described by means of a lensing 
potential, defined as a function of position Q on the celestial sphere. Let Ouvione 
coordinatize the last scattering surface as it would look in the absence of lensing, 
and define a to be the position on the last scattering surface with lensing taken 
into account. We may express 


Chews = Oh vstene + V Piens (Q). (56) 
It follows that 
Tiens —= Tunlens = (Viens) . (VTuniens); (57) 


and similar expressions may be derived for the polarization. As in weak lensing, we 
may derive a projected Poisson equation for the lensing potential 


VDiene(0) = [ de W (Ei; Heoinns) oO, 2), (58) 
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where V? is the Laplacian operator on the sphere. More details about the lensing 
of the CMB and what can be learned from it may be found in the review!?* and 
references therein. 

The first discovery of gravitational lensing of the CMB involved detecting a sta- 
tistically significant nonzero cross-correlation between a lensing map reconstructed 
using the WMAP data and infrared galaxies.?°? The use of a cross-correlation allows 
one to detect a small signal within a noisy map. Figure 17 shows the gravitational 
lensing spectrum as measured by Planck.!®” Measurements of gravitational lens- 
ing have also been made by ACT? and SPT,?!* and the Polarbear experiment 
has recently reported the discovery of B modes due to lensing.'8* The lensing field 
described above (which can be expressed either as a potential, as a deflection field, 
or as a dilatation and shear field) can in principle be recovered from distortions in 
the small-scale power spectrum due to the lensing field on larger scales. As a general 
rule, the higher the resolution of the survey, the better the reconstruction. Hu and 
Okamoto!4#:93 proposed a reconstruction based on Fourier modes, and later many 
other estimators have been proposed, which are all quite similar because they are all 
exploiting the same signal. In the future, as observations at higher sensitivity and 
in particular due to B modes at small scales become available, lensing promises to 
become a powerful probe of the clustering of matter and halo structure. It is hoped 
that lensing will eventually be able to determine the absolute neutrino masses, 
even if they are as low as allowed by present neutrino oscillation data.!?° Lensing 
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Fig. 17. Power spectrum of CMB gravitational lensing potential as measured by Planck. The 
angular power spectrum for the lensing potential as reported in the Planck 2013 results is shown 
above. The overall statistical significance for a lensing detection is greater than 250. The solid 
curve indicates the expectation based on the concordance model whose six parameters have been 
fixed by the Planck data combined with ancillary data. For more details, see Ref. 167. (Credit: 
ESA/Planck Collaboration) (For color version, see page II-CP9.) 
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detected through B modes is particularly advantageous because the fact that the 
B modes should be zero in the absence of lensing means that cosmic variance does 
not intervene and that only the instrument noise is an issue.4 


9. CMB Statistics 
9.1. Gaussianity, non-Gaussianity, and all that 


For the simplest models analyzed at linear order, the primordial CMB signal is the 
outcome of an isotropic Gaussian random process on the celestial sphere. While 
any particular realization of this random process produces a sky temperature map 
that is not isotropic, the hypothesis is that the underlying stochastic process is 
isotropic. Concretely the hypotheses of isotropy and Gaussianity imply that the 


probability of obtaining a sky map T(Q), whose multipole expansion in terms of 
spherical harmonic coefficients is given by 


TOSS) y aanKonlO), (59) 


is as follows: 


fore) +2 


1 
PUax,})'= : 
i NO re 2 Cy 


(60) 


Here 2 = (@,@) denotes a position on the celestial sphere and the set of posi- 
tive coefficients eran represents the angular power spectrum of the underlying 
stochastic process, or the theoretical power spectrum. 

If we had postulated Gaussianity alone without the additional hypothesis of 
isotropy, we would instead have obtained a more general probability distribution 
function having the form 


= 1 P = 
P({dem}) = det 1/2 (2nCin) exp = s > Gem” (Crp) * parte) ken! : (61) 


£,m £',m!’ 


Here C;;, is the covariance matrix for the spherical harmonic multipole coefficients. 
This Ansatz is too general to be useful because there are many more parameters 
than observables, and in cosmology we can observe only a single sky — or said 
another way, only a single realization of the stochastic process defined in Eq. (61). 
The assumption of isotropy is very strong, greatly restricting the number of degrees 


4This conclusion hardly depends on r whatever its final value may turn out to be, because the 
angular scales best for probing lensing lie beyond the recombination bump of the B modes. 
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of freedom by setting all the off-diagonal elements to zero and requiring that the 
diagonal elements depend only on @. 

Happily, the present data, subject to some caveats (such as gravitational lensing, 
discussed above in Sec. 8), seem consistent with the hypotheses of isotropy and 
Gaussianity, notwithstanding some “anomalies” at moderate statistical significance 
(discussed in detail below in Sec. 12). These anomalies could be a sign of something 
new, but given their limited statistical significance, the argument that they are 
simply statistical “flukes” cannot be rejected. 

For completeness, we now indicate how the above formalism is extended to 
include the polarization of the primordial CMB signal. There are several formalisms 
for describing the polarization of the CMB, for example, using spin-weighted spher- 
ical harmonics, or using the Stokes parameters I, Q, U, V.?8 (The polarization of 
the CMB and its description are discussed in Refs. 35, 96, and 108.) We may define 
Tap(Q) = (Ea(Q)*Ep(Q)) time where Eq(Q) is the electromagnetic field component 
propagating from the direction Q and units are chosen so that for a unit electric 
polarization vector €, T,p(Q)eaep represents the CMB thermodynamic temperature 
when by means of a linear polarizer only the component along € is measured. We 
may decompose 


Tap(Q) = T(Q)5an + Pra(Q), (62) 


where the polarization tensor (containing the Stokes parameters Q,U, and V) is 
trace-free and Hermitian (and also symmetric in the absence of circular polariza- 
tion). 

We must now choose a convenient basis for the polarization. If we considered 
a single point on the celestial sphere, we could using Stokes parameters set Q = 
T\1 —To2 and U = 2T\»9, but no natural choice for the directions of the orthonormal 
unit vectors é; and é2 in the tangent space at Q can be singled out as preferred. 


Had we instead chosen 
é} cos@ siné\ (/é; 
a. F ~ |? (63) 
€5 —sin@ cos é9 


we would instead have the Stokes parameters 


a _ ( - 20 sin A ) (64) 
U' —sin2@ cos26 U 


The transformation law in Eq. (64) indicates that the traceless part of the polar- 
ization is of spin-2. If the sky were flat with the geometry of R? instead of S? and 
we had to include homogeneous modes as well, there would be no way out of this 
quandary. But there is a celebrated result of differential geometry that a fur on a 
sphere of even dimension cannot be combed without somewhere leaving a bald spot 
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or singularity." A traceless tensor field on the sphere cannot have an everywhere 
vanishing covariant derivative. 

We can generate a complete basis for the polarization on the celestial sphere by 
means of derivative operators acting on the spherical harmonics. Using Yet) as 
a starting point, we may define an E mode basis vector as follows: 


& 1 a 
Eanem(O) = NEP (Was ~ 5500?) Yim (@), (65) 


where NP *B is a normalization factor. Here a,b = 1,2 indicate components with 
respect to an arbitrary orthonormal basis on S$”. Without introducing more struc- 
ture, this is the only way to produce a second-rank traceless tensor on the sphere 
with derivative operators acting on Yem(Q) in a way that does not break isotropy, 
a feature that ensures that Eaptm(Q) transforms under rotations according to the 
quantum numbers £m. However, there is another polarization, rotated with respect 
to the E mode by 45°, and in order to write down a basis for this other polarization, 
known as the B modes, it is necessary to introduce an orientation on the sphere 
through the unit antisymmetric tensor with €;2 = 1. This choice of orientation, or 
volume element in the language of differential geometry, defines a handedness, or 
direction in which the 45° rotation is made. 
We may define 


Bab = (daa’ + €aa’ ) (Opp! =P €pp' ) Ea’, (66) 


Nle 


or in terms of derivative operators 


1 i 
Bab = 5 NP” (CacSba + €bc0ad) VeV dYem(22). (67) 


The difference between E modes and B modes is illustrated in Fig. 18. We want to 
normalize so that 


f 2B 0.0m DE ene) _ [2B arom OBir,em(®) = 60,0 Om,m! (68) 
and 
/ AOE ch ten SD) Bee tier) =O (69) 


so we require that! 


2 
Ny" = (ran 1)e(@— 1)’ mg) 


'This is a consequence of the “hairy ball” theorem.®° 
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Fig. 18. E and B modes of the CMB polarization. The polarization pattern in the left panel can 
be expressed as the double derivative acting on a potential with its trace removed (in this case a 
Gaussian potential) and thus is an E mode. The B mode pattern on the right, however, cannot 
be represented in this way, but can be represented by a pseudo-scalar potential whose double 
derivative with the trace removed followed by a 45° rotation gives the indicated pattern. 


Although the above discussion emphasized the role of spherical harmonics, the 
distinction between F& and B modes does not rely on spherical harmonics, and 
much confusion has resulted from not fully appreciating this point. The absence of 
an E mode component or a B mode component may be formulated in a completely 
local manner using covariant derivative operators acting on the polarization tensor 
Pap (Q). If VaPap = 0, then the £ mode is absent, and similarly if €g,VaPre = 0, 
then the B mode is absent. Consequently, if we are willing to differentiate the data, 
we could detect B modes locally with absolutely no leakage. However, the above 
does not imply that a unique decomposition of P,, into E and B on an incomplete 
sky is possible because nonsingular vector fields (akin to harmonic functions) can 
be found that satisfy both conditions, although no nontrivial such vector fields 
exist defined on the full sky. Where global topology does enter is in eliminating 
configurations that satisfy both the above differential constraints. 

Having defined the EF and B modes, we now indicate how the statistical dis- 
tribution defined in Eq. (60) is modified to include polarization. We modify our 
notation somewhat to avoid unnecessary clutter. When polarization is included, 
our vector {az,,} is generalized to become {aj,,,,af,,,,a¢,,}, Which we shall denote 
as the vector (t,e,b). For each ¢ we have a positive definite covariance matrix of 
the form 


ie TE TB 
Ce Ce Ce 


C= | CP Cr cr i. (71) 
CP ere: a 


If we assume that the underlying physics imprinting the cosmological pertur- 
bations is invariant under spatial inversion, then the above covariance matrix 
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simplifies to 


C=|C" ce 6 I; (72) 


because under spatial inversion 
a = =F", cre = =Or” (73) 


whereas the other power spectra coefficients preserve their sign. 
If parity is a symmetry of the physical processes generating the primordial per- 


turbations and imprinting the CMB anisotropies, the expectation values ore and 


Cr?” must vanish, although for any particular sky realization C; *°?* and C/?:°Ps 
will not vanish because of cosmic variance. (Recall that on™ and OF Bth is the 
average over a fictitious ensemble including an infinite number of sky realizations.) 
Studying Cor and OF B.obs to search for statistically significant deviations from 
zero is a way to search for parity violation in the very early universe. However, 
in practice adjusting experimental parameters to minimize C;?°?* and C/?°”* is 
often used to calibrate detector angles in B mode polarization experiments. 

The propagation of light, or electromagnetic radiation, at least historically and 
conceptually, occupies a privileged place in the theories of special and general rela- 
tivity, because of the constancy of the speed of light in vacuum and the fact that at 
least in the eikonal approximation, light travels along null geodesics, with its elec- 
tric polarization vector propagated along by parallel transport. It is by thinking of 
bundles of light rays that we construct for ourselves a physical picture of the causal 
structure of spacetime. A modification of the propagation of light (for example, from 
dilaton and axion couplings,'4! a possible birefringence of spacetime,??6?:8?:2° or 
alternative theories of electromagnetism™:’!-14) would not necessarily undermine 
the foundations of relativity theory, but would probably merely lead to a more 
complicated theory, possibly only in the electromagnetic sector. The CMB may be 
considered an extreme environment because of the exceptionally long travel time of 
the photons, and one can search for new effects beyond the known interactions with 
matter (e.g., Thomson and other scattering, collective plasma effects such as disper- 
sion and Faraday rotation). Some such effects would lead to spectral distortions, for 
example through missing photons. The rotation of the polarization vector during 
this long journey would lead to mixing of E and B modes, and since the primor- 
dial E modes are so much larger, leads to interesting constraints on birefringence, 
which will greatly improve as more data comes in from searches for primordial 
B modes. 

The statistical description in Eqs. (71) and (72) suffices for confronting the pre- 
dictions of theory with idealized measurements of the microwave sky with complete 
sky coverage and no instrument noise. We must further assume that there are no 
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secondary anisotropies, nor galactic foregrounds, both of which have been perfectly 
removed. 

Even under these idealized assumptions, it is not possible to pin down the theory 
completely because of a phenomenon known as cosmic variance. The problem is that 
while we would like to characterize the covariance of the underlying theory Ge 
(where A, B = T,E, B), we have only a single realization of the microwave sky, and 
Cr is only an estimate of aa with fluctuations about its expectation value, 
which is equal to cr. If we consider just the temperature fluctuations, we may 
define the sample variance for the @th multipole 


+6 
1 
CTT obs _ ml? 74 
‘ Eri) Da len wet 
The random variable Con obeys a x?-distribution with (2¢+ 1) degrees of free- 


dom and thus has fractional variance of \/2/(2€+ 1). If we consider a microwave 
sky map bandlimited up to €max, there are approximately a degrees of free- 
dom, so for example the overall amplitude of the cosmological perturbations may 
be determined with a fractional accuracy of approximately 1/max. 

Most analyses of the CMB data (possibly combined with other data sets) assume 
a model where the theoretical power spectrum depends on a number of cosmological 
parameters, which we may abstractly denote as a1, Q@2,...,@p, also written more 
compactly as the vector a. For example, in the six-parameter minimal cosmological 
model claimed sufficient in the 2013 Planck Analysis!® to explain the present data, 
the parameters comprising this vector were Ag,ns, Ho, w»p,wWcpMm, and T. 

Whether one adopts a frequentist or Bayesian analysis to analyze the data, the 
input is always the relative likelihood of the competing models given the observa- 
tions, or a ratio of the form: 


P(observations | a) 


P(observations | a2)’ 72) 
In this ratio, the measure for the probability density of the outcome, represented 
by a continuous variable, cancels. 

In this abridged discussion, we shall follow a Bayesian analysis where a prior 
probability Psrior(@) is assumed on the space of models. In Bayesian statistics, 
Bayes’ theorem is used to tell us how we should rationally update our prior beliefs 
(or prejudices) in light of the data or observations to yield a posterior probability 
to characterize our updated beliefs given by the formula 


_ P(observations | a) Porior(Q) 
fda! P(observations | a’) Porior(@’) 


Ppoosterior (@) (76) 


There is no reason why the posterior distribution should lend itself to a simple 
analytic form, and a common procedure is to explore the form of the posterior 
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distribution using Markov Chain Monte Carlo (MCMC) methods,°”!2? which are 
particularly well suited to explore distributions of large dimensionality. 

However, it often occurs that the likelihood and the posterior distribution can be 
adequately represented as a Gaussian by expanding about its maximum likelihood 
value. In this case, the maximum of the log of the probability can be found using 
a numerical optimization routine, and then by computing second derivatives using 
numerical finite differences, the Gaussian approximation to the posterior can be 
found. This infinitesimal analysis is often referred to as a Fisher analysis. 


9.2. Non-Gaussian alternatives 


It is incredibly difficult to test the Gaussianity of the primordial microwave sky 
without some guidance from theory as to what non-Gaussian alternatives are well 
motivated. The space of non-Gaussian stochastic models is dauntingly vast, with the 
space of Gaussian models occupying by any measure an almost negligible fraction 
of the model space. If one tests enough models, one is guaranteed to produce a 
result beyond any given threshold of significance. Consequently, one of the questions 
lurking behind any claim of a detection of non-Gaussianity is how many similar 
models did one test. 


10. Bispectral Non-Gaussianity 


Although it was recognized that inflation, in its simplest form described by Einstein 
gravity and a scalar field minimally coupled to gravity, would have some nonlinear 
corrections, early analyses of the cosmological perturbations generated by inflation 
linearized about a homogeneous solution and calculated the perturbations in the 
framework of a linearized (free) field theory. According to the lore at the time, the 
nonlinear corrections to this approximation would be too small to be observed. The 
first calculations of the leading nonlinear corrections were given by Maldacena!?® 
and Aquaviva et al.,” who calculated the bispectrum or three-point correlations 
of the primordial cosmological perturbations within the framework of single-field 
inflation. 

Subsequent work indicated that in models of multi-field inflation, bispectral non- 
Gaussianity of an amplitude much larger than for the minimal single-field inflation- 
ary models can be obtained. The predictions of many of these multi-field models 
were within the range that would be detectable by the Planck satellite. For many of 
these models, the non-Gaussianity, owing to the fact that it is generated by dynam- 
ics on superhorizon scales where derivative terms are unimportant, is well described 
by the “local” Ansatz for bispectral non-Gaussianity, under which the non-Gaussian 
field Cyz is generated from an underlying Gaussian field ¢; according to the rule 


Cwo(x) = Cr (x) + [6r(x)]’, (77) 
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where fyz is a dimensionless parameter that quantifies the degree of non- 
Gaussianity. 

Here fy z is independent of wavenumber, but a dependence on wavenumber can 
also be contemplated, so that the above relation is generalized to the following (now 
working in wavenumber space): 


¢nz(k) = [tie fee f(s kash) 5° — I — ka)¢r(ki)Cr (ke). (78) 


Here as a consequence of the hypothesis of isotropy, we have expressed fyz as a 
function depending only on the lengths of the vectors. Any two triangles (where 
the three vectors close) that may be mapped into each other by an isometry are 
assigned the same fz. 

For bispectral non-Gaussianity of the local form, as defined in Eq. (77), we 
briefly sketch how the order of magnitude of (S/N)? for a detection of fz from 
a CMB map extending up to max may be estimated. We simplify the estimate by 
employing the flat sky approximation, allowing us to replace the unintuitive discrete 
sums and Wigner 37 symbols with the more transparent continuum integrals and a 
two-dimensional 6-function. Approximating Cy ~ Cof~? (in other words, we ignore 
acoustic oscillations, damping tails, and all that, and we assume that the CMB is 
scale invariant on the sky, as would occur if one could extrapolate the simplified 
Sachs—Wolfe formula (AT) /T = —®/3 to arbitrarily small scales), we may write 


(&) = 000 f cae | cae | Tepe tee) 


- five?Co* (br 72ba 7 + bo 72037? + b3~ 76, ~7)? 
Cae “kK 


= O(1)(fvz)’Colmax? In (=). (79) 


min 


The quantity in the first line simply approximates the sum over distinguishable 
triangles, the combinatorial factors being absorbed in the O(1) factor, and the 
second line gives the signal-to-noise squared of an individual triangle, as calculated 
diagrammatically. More exact calculations taking into account more details confirm 
this order of magnitude estimate. Numerical calculations are needed to determine 
the O(1) factor and deal with €;1ax more carefully. Nevertheless this rough estimate 
reveals several characteristics of local non-Gaussianity, namely: (i) the presence 
of the In(¢max/fmin) factor indicates that the bulk of the signal arises from the 
coupling of large angle modes and small angle modes (i.e., the modulation of the 
smallest scale power near the resolution limit of the survey by the very large angle 
modes (i.e., & = 2,3,...)), and (ii) the cosmic variance of the estimator derives 
from the inability to measure the small-scale power accurately, and thus decreases 
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in proportion to the effective number of resolution elements of the survey. Cosmic 
variance on large scales is not an issue. As long as we know the particular realization 
on large scales accurately (which is not subject to cosmic variance), we know what 
kind of modulation to look for in the small-scale anisotropies. 

Theoretical models for other shapes for the bispectral anisotropy can be moti- 
vated by models for fundamental physics (see for example Refs. 9 and 107 and 
references therein) and have been searched for as well. The Planck 2013 Results gave 
the following limits!”: floc! = 2.745.8, fet" = —42 +75, and forthe = —25 + 39. 
This result rules out many of the models developed to explain a result at low statis- 
tical significance from some analyses of the WMAP data and was a profound disap- 
pointment to those who hoped that Planck would turn up striking evidence against 
the simplest inflationary models. One may anticipate that the results from Planck 
2014 will improve modestly on the 2013 results by including more modes because 
of the use of polarization. If there is a next-generation all-sky CMB polarization 
satellite having good angular resolution so that S/N 1 maps may be obtained up 
to max © 3000, an improvement on the limits on fyy by a factor of a few may 
be envisaged. However, beyond @ = 3000, the primary CMB anisotropies become 
a sideshow. As one pushes upward in @, other nonprimordial sources of anisotropy 
take over, such as gravitational lensing, the Sunyaev—Zeldovich effect, and point 
sources of various sorts. These contaminants have angular power spectra that rise 
steeply with @ while the CMB damping tail is falling rapidly, in fact almost expo- 
nentially. For including more modes, polarization may be helpful. On the one hand, 


polarization measurements present formidable instrumental challenges because of 
their lower amplitude compared to the temperature anisotropies. However, for the 
polarization, the ratio of foregrounds to the primordial CMB signal is favorable up 
to higher @ than for temperature. For searching for non-Gaussianity, the primordial 
CMB is highly linear, unlike other tracers of the primordial cosmological pertur- 
bations. However given that the basic observables are two-dimensional maps, the 
small number of modes compared to three-dimensional tracers of the primordial 
perturbations is a handicap. 


11. B Modes: A New Probe of Inflation 


As discussed above, some limits on the possible contribution from tensor modes, 
or primordial gravitational waves, can be obtained from analyzing the C7? power 
spectrum. However, given that r is not large, these limits become highly model 
dependent, and because of cosmic variance as well as uncertainty as to the correct 
parametric model, cannot be improved to any substantial extent. This difficulty is 
illustrated by the results from the Planck 2013 analysis,!”! which did not include 
the data on polarized anisotropies collected by Planck but only the TT power spec- 
trum. If one assumes the minimal six-parameter model plus r added as an extra 
parameter, a limit of r < 0.11 (95% C.L.) is obtained. However, if the primordial 
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power spectrum in this model (where a form P(k) ~ k":~! is assumed) is gener- 
alized to P(k) ~ (k/k*)"*—1 exp(a(In(k/k*))?), the limits on r loosen by about a 
factor of two to become r < 0.2 (95% confidence level). This lack of robustness to 
the assumptions of the parametric model highlights the fragility of this approach 
and demonstrates that the statistical error bars are not to be trusted unless one 
has strong reason to trust the underlying parametric model. However, given our 
current understanding of inflationary potentials and dynamics, we have no reason 
to trust any of the parametric models, which serve more as fitting functions for 
summarizing the current state of the observations. 

Searching for tensor modes using B modes, on the other hand, allows us to 
probe much lower values of r in a manner that is substantially model independent 
because in the linear theory “scalar” perturbations cannot generate any B modes, 
and this conclusion is independent of the parametric model assumed. 


11.1. Suborbital searches for primordial B modes 


In March 2014, the BICEP2 Collaboration!”’!® claimed a detection of primordial 
gravitational waves based primarily on observations at a single frequency (150 GHz) 
over a small patch of the sky (1% of the sky) that was believed to be of particularly 
low polarized dust emission. Using a telescope based at the South Pole, the BICEP2 
team created a map at 150GHz of the polarized sky emission from which the B 
mode contribution was extracted. The BICEP2 analysis included many null tests 
to estimate and exclude systematic errors as well as to estimate the statistical 
noise in the measurement. However, in order to claim a detection of primordial 
gravitational waves, the BICEP2 team had to exclude a nonprimordial origin for 
the observed B mode signal, for which the main suspect would be polarized thermal 
dust emission. An independent analysis called into question the BICEP2 estimate 
of the likely dust contribution in their field, claiming that all the observed signal 
could be attributed to dust. A subsequent Planck analysis estimating the dust 
contribution in the BICEP2 field by extrapolating polarized dust maps at higher 
frequencies where dust is dominant down to 150GHz confirmed this finding.'°® 
While this analysis does not necessarily exclude a nonzero primordial contribution 
to the B modes observed by BICEP2 at 150GHz, the argument crucial to the 
BICEP2 claim that a dominant contribution from dust can be excluded collapses 
in light of this new finding from Planck concerning the expected contribution from 
thermal dust emission. A cross-correlation study is now underway as a joint project 
of the two teams that will combine the BICEP2 map at 150 GHz with the Planck 
polarization maps at higher frequencies (which are dominated by dust). It remains 
to be seen whether this effort can establish a limit on r better than what is currently 
available or whether this combined analysis might even result in a detection of 
primordial B modes albeit at a lower level of r. 

In any case, a number of experiments are now underway involving observations 
at multiple frequencies that will push down the limits on r using B modes if not 


TI-96 M. Bucher 


result in a first detection if r is not too small. The BICEP2 team has a series of 
upgrades to their experiment expanding their frequency coverage as well as mas- 
sively increasing the number of detectors and hence their sensitivity. Other compet- 
ing efforts include POLARBEAR,® SPIDER*™ and QUBIC," as well as ACTPol 
and SPTPol. 

Moreover, farther into the future, a more ambitious upgrade to ground based 
efforts is contemplated through the US Stage 4 (S4)''? CMB experiment currently 
under consideration by the US DOE. S4 contemplates deploying approximately a 
total of 2x 10° detectors from the ground in order to achieve a massive improvement 
in raw sensitivity. Although it is claimed that an improvement in the control of 
systematic errors to a comparable level can be achieved, it remains to be seen 
whether S4 can realize its projected performance. 


11.2. Space based searches for primordial B modes 


Several groups around the world have proposed space missions specifically dedicated 
to mapping the microwave polarization over the full sky with a sensitivity that 
would permit the near ultimate measurement of the CMB B modes. In Europe, three 
ESA missions had been proposed: BPol in 2007,47 COrE?? in 2010, both of which 
were medium-class, and the large-class mission PRISM!®° in 2013. None of these 
missions was selected. A mission proposal called COrE+ is currently in the process 
of being submitted. In Japan a mission called LiteBird®! has been developed, but 
final approval is still pending. In the US, CMBPol/EPIC*! has proposed a number 
of options for a dedicated CMB polarization space mission,® but none of these has 
been successful in securing funding. All the above concepts involve a focal plane 
including several thousand single-mode detectors in order to achieve the required 
sensitivity. Another proposal endeavors to achieve the needed sensitivity by another 
105 »roposal, a Martin-Pupplet Fourier 
spectrometer comparing the sky signal to that of an artificial blackbody is proposed. 
The setup is similar to the COBE FIRAS instrument, except that polarization 
sensitive bolometers and a more modern technology are used, which will allow an 
improvement in the measurement of the absolute spectrum about two orders of 
magnitude better than FIRAS. 


means: multi-moded detectors. In the Pixie 


12. CMB Anomalies 


The story recounted so far has emphasized the agreement of the observations accu- 
mulated over the years with the six-parameter so-called “concordance” model. But 
this would not be a fair and accurate account without reporting a few wrinkles 
to this remarkable success story. These wrinkles may either be taken as hints of 
new physics, or discounted as statistical flukes, perhaps using the pejorative term 


SSee http://cmbpol.uchicago.edu/papers.php for a list of papers on this effort. 
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a posteriori statistics. On the one hand, one can argue that if one looks at enough 
models that are in some sense independent, one is bound to turn up something 
at high statistical significance, and by most standards the significance of these 
anomalies is not high. Which interpretation is preferred is presently under debate. 
Ultimately, despite all the fancy statistical terminology used in this discourse, what 
one concludes inevitably relies on a subjective judgment of theoretical plausibility. 

One anomaly explored over the years, known under several names including 
“hemispherical asymmetry,” “dipolar modulation,” and “bipolar disorder,” asks 
the question whether the local angular power spectrum is identical when compared 
between opposite directions on the celestial sphere. There is no unique way to 
pose this question precisely. The statistical significance, amplitude, and direction 
of the asymmetry depend somewhat on the precise formulation chosen. However, 
the WMAP finding of a dipolar modulation with an amplitude of 7% and a signifi- 
cance ranging within about 2—4o, and pointing approximately toward (1, b) = (237°, 
20°) in galactic coordinates is broadly confirmed by the Planck 2013 Results. This 
confirmation by Planck, owing to the additional high frequency coverage, renders 
less plausible a nonprimordial explanation based on foreground residuals. The sen- 
sitivity to dipolar modulation on these angular scales is primarily limited by cosmic 
variance for both WMAP and Planck, so future experiments cannot hope to improve 
substantially on the significance of these results. Planck, however, because of its 
superior angular resolution, was able to probe for dipolar modulation pushing to 
smaller scales, where cosmic variance is less of a problem and a possible mod- 
ulation can be constrained more tightly. [The measurement uncertainty in the 
scale-dependent dipole modulation of the amplitude of the perturbations AA(é) 
(assuming broad binning) scales as 1/¢ as long as cosmic variance is the limit- 
ing factor.] The Planck 2013 data find no evidence of dipolar modulation at the 
same amplitude extending to small angular scales, suggesting that if the dipolar 
modulation observed by WMAP is not a statistical fluke, a scale-dependent theo- 
retical mechanism for dipolar modulation is needed. Moreover, Planck does not see 
evidence of modulation associated with higher multipoles (e.g., quadrupolar and 
higher order disorders). This last point is important for constraining theoretical 
models producing statistical anisotropy by means of extra fields disordered during 
inflation. 

Another anomaly is the so-called “cold spot.” Rather than limiting ourselves 
to two-point statistics, or to three-point statistics, searching for a bispectral signal 
using theoretically motivated templates, as described in Sec. 10, we may ask whether 
the most extreme values of an appropriately filtered pure CMB map lie within the 
range that may be expected assuming an isotropic Gaussian stochastic process, or 
whether their p-values (i.e., probability to exceed) render the Gaussian explanation 
implausible. There are many ways to formulate questions of this sort, a fact that 
renders the assessment of statistical significance difficult. One approach that has 
been applied to the data is to filter the sky maps with a spherical Mexican hat 
wavelet (SMHW) filter, which on the sphere would have the profile of the Laplacian 
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operator applied to a Gaussian kernel of a given width o, giving a broad |i.e., 
(Aé)/€ = O(1)] two-dimensional spatial bandpass filter. (See Ref. 215 for more 
details concerning the methodology and Ref. 216 for the original claim of a cold 
spot detection. See also Ref. 248 for an assessment by the WMAP team as well as 
Ref. 172 for the analysis from Planck 2013.) 

The final anomaly concerns possible alignments of the low-¢ multipoles, 
imaginatively named the “axis of evil” by Land and Magueijo.!'+ A Gaussian 
isotropic theory for the temperature anisotropy’ predicts that the multipole vectors 
ay = A pel , are each Gaussian and independently distributed. If we add three 
or more such vectors together according to the tensor product of representations of 


$0(3) 
[1 @L2@:---@LIny, (80) 


which may be decomposed into a direct sum of irreducible representations, provided 
that the triangle inequality is satisfied, we may extract one or more scalars (trans- 
forming according to L = 0) from the above tensor product. The mathematics is 
simply that of the usual addition of angular momentum. For each of these scalars, 
we may ask whether the observed value lies within the range expected from cosmic 
variance in the framework of a Gaussian theory taking into account the uncertainty 
in the determination of the underlying power spectrum. The approach just described 
was not the approach of Land and Magueijo, who maximized max,,¢[_-¢,+¢ |@¢m()| 
as a function of n where ag,,(n) is the expansion coefficient in the coordinate sys- 
tem with n pointing to the north pole. In this way, for each generic value of the 


multipole number @ a unique double headed vector, or axis, +n may be extracted, 
and the alignments between these axes may be assessed. Land and Magueijo report 
an alignment of the ¢ = 3, £ = 4, and @ = 5 axes with a p-value less than 107? in 
the Gaussian theory. Accurate p values may be obtained for each of these invariant 
quantities by resorting to MC simulations to account for the practicalities of dealing 
with a cut sky, etc. However, there still remains a subjective element to assessing 
statistical significance. One may ask how many invariants were tried before arriving 
at a reportable statistically significant anomaly. 


13. Sunyaev—Zeldovich Effects 


In the simplified discussion of reionization presented in Sec. 7.5, it was assumed that 
the electrons responsible for the rescattering are at rest with respect to the cosmic 
rest frame. This approximation treats the reionized gas as a cold plasma having 
a vanishing peculiar velocity field. As pointed out by Sunyaev and Zeldovich,?7% 
corrections to this approximation arise in two ways: (i) In the so-called thermal 


Sunyaev—Zeldovich effect (tSZ), the random thermal motions of the free electrons 


tThe discussion can readily be generalized to polarization, but we shall stick to the temperature 
anisotropy alone in our discussion. 
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Fig. 19. Frequency dependence of thermal Sunyaev—Zeldovich spectral distortions. The solid 
and dashed curves show the fraction spectral distortion in terms of the CMB thermodynamic 
and R-J temperatures, respectively. For low frequencies (v < 217 GHz), there is a decrement in 
temperature, corresponding to the fact that low frequency photons are on the average Doppler 
shifted to higher frequencies by the hot electrons. On the other hand, for v > 217 GHz the net 
effect is to increase the photon phase space density. 


alter the spectrum of the rescattered CMB photons through the Doppler effect and 
Compton recoil. Because (v) = 0, this effect is second order in the velocity, or linear 
in the electron temperature, and results in a spectral distortion not respecting the 
frequency dependence of a blackbody spectrum with a perturbed temperature. (See 
Fig. 19.) (ii) In the kinetic Sunyaev—Zeldovich (kSZ) effect the peculiar velocity of 
the gas results in a shift in temperature proportional to the component of the 
peculiar velocity along the line of sight. The kSZ effect has the same frequency 
dependence as the underlying primary CMB perturbations, thus making it hard 
to detect given its small magnitude. There is also a much smaller effect where the 
transverse peculiar velocity imparts a linear polarization to the scattered photons 
for which the polarization tensor P;; is proportional to (vi ®v1 — su .76,), where 
v_ is the peculiar velocity perpendicular to the line of sight and 6, is the Kronecker 
delta function in the plane perpendicular to the line of sight. 

Galaxy clusters are filled with hot gas [T,,,; = O(10 KeV)] that emits primarily 
at X-ray frequencies. The hot electrons of this fully ionized gas, or plasma, scatter 
CMB photons by Thomson, or Compton, scattering, shifting their frequency by a 
factor of approximately (1 + Gcos@) where 3 = kgT/m-c? and we have ignored 
higher order corrections in 3. The y-distortion parameter along a line of sight is 
given by the integral 


be Ls 
y= far = f torn, 2, (81) 
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and the fractional perturbation in the CMB thermodynamic temperature in the 
nonrelativistic approximation is given by 


Temp, 2) _ (e +1) 


e 


TomB er 


1) vO), (82) 


where « = hv/kpTomp = v/(57 GHz) and y(Q) is the y-distortion map as defined 
by the line of sight integral in Eq. (81). 

On the one hand, the thermal Sunyaev—Zeldovich effect provides a powerful 
probe of the dynamics of galaxy clusters, which can be used to discover new clusters 
and to probe the structure of known clusters away from their central core. On the 
other hand, for observing the primordial CMB, the thermal Sunyaev—Zeldovich 
effect constitutes a contaminant that must either be removed or modelled. The 
review by Birkinshaw!® recounts the early history of tSZ measurements. See also 
the reviews by Carlstrom et al.?? and by Rephaeli.’®! For the observation of the 
kSZ, see Ref. 79. For more recent measurements, see the SZ survey papers from the 
ACT,1!2° SPT,??3 and Planck!®:!79 collaborations. See Ref. 185 for a discussion of 
what might ultimately be possible from space in the future. 


14. Experimental Aspects of CMB Observations 


Like astronomical observations at other wavelengths, most modern microwave 
experiments consist of a telescope and a number of detectors situated at its focal 
plane. The COBE experiment was an exception, as it used single-moded feed horns 
pointed directly at the sky to define its beam, which was relatively broad (7° 
FWHM). However for measurements at higher angular resolution, it is difficult 
to construct a feed horn forming a sufficiently narrow beam on the sky without the 
help of intermediate optics, unless one resorts to interferometry as for example in 
the DASI experiment. 

Observations of the sky at microwave frequencies present a number of challenges 
unique to this frequency range. Unlike at other frequencies, the microwave sky is 
remarkably isotropic. The CMB monopole moment, and to a much lesser extent 
the dipole moment, dominates. In the bands where other contaminants contribute 
least—that is, roughly in the range 70-150 GHz—-virtually all the photons collected 
result from the isotropic 2.725 K background. The brightest feature superimposed 
on this uniform background is the CMB dipole, resulting from our proper motion 
with respect to the cosmic rest frame, at the level of 0.1%. When this dipole com- 
ponent is removed, the dominant residual is the primordial CMB anisotropy, whose 
amplitude on large angular scales is about 35K, or roughly 10~°. This situation 
is to be contrasted for example with X-ray astronomy where only some 50 photons 
need to be collected to make an adequate galaxy cluster detection. For the CMB 
about 10'° photons must be collected in each pixel just to obtain the temperature 
anisotropy at a signal-to-noise ratio of one, and many more photons are needed for 
measuring polarization at the same marginal signal-to-noise. The challenge is to 
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measure minute differences in temperature between different points on the celes- 
tial sphere without introducing spurious effects. These can arise from a variety of 
sources: drifts in the zero point of the detector, detector noise, hot objects in the 
far sidelobes of the beam, and diffraction around the edges of the mirrors, to name 
some of the most common problems. 

For this reason, almost all measurements of the microwave sky are in some 
sense differential. Drifts in the zero point of the detector (known as 1/f noise) 
can be mitigated by rapidly switching either between the sky and an artificial cold 
load with a long time constant or between different parts of the sky. This can 
be accomplished by moving the beam rapidly across the sky, so that in one way 
or the other the basic observable becomes differences in intensity of pixels whose 
measurements are closely separated in time. The point to take away is that one 
absolutely avoids making direct rather than differential measurements of the sky 
temperature. 

Another specificity of astronomical observations at microwave frequencies is that 
extremely stringent requirements must be imposed limiting the magnitude of the 
far sidelobes of the beam. Given the smallness of the differences in temperature of 
interest, contamination from the ground, the earth, the sun, and moon, and also 
from the galaxy near the galactic plane, can easily introduce spurious signals if the 
beam does not fall sufficiently rapidly away from its central peak. Ground pickup 
(from a spherical angle of 27) is particularly challenging to shield, and this is why 
the WMAP and Planck satellites were situated at L2 (the second Lagrange point, 
where the earth and the moon have a tiny angular diameter) rather than in a low- 
earth orbit, which would equally well avoid atmospheric emission, which is the other 
major motivation for going to space. The second earth-sun Lagrange point (L2) is 
1.5x 10°km from the earth — that is, about 4 times the earth—moon distance. From 
L2 the earth subtends an angle of only 16 arcminutes. More importantly from L2, 
the sun, earth, and moon all appear at approximately the same position in the sky 
making it easier to keep away from the far sidelobes of the beam. 

A formidable challenge for suborbital observations is emission from the atmo- 
sphere, primarily but not exclusively from water vapor. The emission from the 
atmosphere has a complicated frequency dependence, as shown in Fig. 20, where 
the optical depth as a function of frequency is shown. This optical depth must be 
multiplied by the atmospheric temperature (i.e., & 270 K) to obtain the sky bright- 
ness. The structure in frequency of the regions of high opacity implies that bands 
must be carefully chosen to avoid emission lines and bands, giving less flexibility for 
wide frequency coverage than from space. The problems arising from atmospheric 
emission can be mitigated to some extent by choosing a site on the ground with 
a particularly small water column density, such as the South Pole or the Atacama 
Desert in Chile, or by observing from a stratospheric balloon. Atmospheric emis- 
sion is problematic in two respects. First, if the atmospheric emission were stable 
in time, it would merely introduce additional thermal loading on the detectors, 
which could be mitigated simply by deploying more detectors or by observing over 
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Fig. 20. Atmospheric optical depth as a function of frequency for 1mm of precipitable water 
vapor. Water vapor is an important but not the only source of atmospheric contamination. See 
Ref. 145 for more details. A column density of 0.5-1mm of precipitable water vapor corresponds 
to good conditions in the Atacama Desert. Conditions at the South Pole are often better by a 
factor of a few. The black curve shows the opacity, or optical depth, and the red curve shows the 
opacity multiplied by 1/20 in order to highlight the structure around the resonances. The limited 
transmission 0 < t < 1 [where t = exp(—rT)], on the one hand, attenuates the desired sky signal. 
But more serious is the superimposed additive noise, having a brightness temperature (1—t)Tatm. 
Here Tatm is the effective temperature of the atmosphere. (Credit: J. Cernicharo) 


a longer time. If the contribution from the sky is T,,y, then the fluctuations in inten- 
sity at the detector (assumed perfect for the moment) would increase by a factor 
of (Tsxy + Tomp)/Tomp, meaning that we could measure the sky temperature with 
the same error by increasing either the number of detectors or the total observation 
time by a factor of ((Tiky + Tomp)/Tomp)*?. At 150GHz from the South Pole, for 
example, to choose a band where the sky brightness temperature is not very high, 
one has Tixyy © 16K. This means that under the idealized assumptions made here, 
almost 40 times more detectors would be required than for a space-based experi- 
ment. But the situation is worse than this, because the atmosphere contamination 
is not stable in time but rather fluctuates in a complicated way characterized by 
a wide range of time scales (owing to the underlying turbulence). Moreover, the 
atmospheric load varies with zenith angle, roughly according to a secant law. Of 
course, observations from the ground are attractive because of their low cost, the 
fact that telescopes can be improved from season to season to address problems 
as they arise, and larger telescopes with many more detectors can be deployed. As 
experiments increase in sensitivity, requirements for mitigating these problems will 
become more stringent. At present it is not known what is the limit on the quality 
of measurements that can be achieved from the ground or from balloons. 


14.1. Intrinsic photon counting noise: Ideal detector behavior 


In radio astronomy (where the frequency of observations is typically lower than 
for CMB observations) the following expression, known as the radio astronomers’ 
equation,4°:!!° gives the fractional error of an observation under slightly idealized 
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assumptions: 


I [Fa + Fs . 1 (83) 
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Here tops is the time of observation of a pixel and (AB) is the bandwidth. Ty, is the 
sky brightness temperature averaged over the pixel and Tiy, represents the addi- 
tional noise introduced within the receiver. This expression can be understood by 
considering the source in the sky as having thermal statistics — that is, fluctuating 
in intensity as a Gaussian random field. In radio astronomy, detection is coherent 
and each detector picks up only a single transverse mode of the electromagnetic 
field incident from the sky, which is converted by an antenna or a feed horn into an 
electrical signal, which depends only on time. This signal is coherently amplified and 
nowadays digitized. Mathematically, in the absence of additional detector noise, the 
sky signal can be thought of as a signal whose complex amplitude fluctuates accord- 
ing to a Gaussian distribution with a variance proportional to T3,y. The coherence 
time is 1/(AB), so in a time interval tops, Nsamp = 2(AB)tops independent real- 
izations of this Gaussian random process whose variance we are trying to measure 
are collected, leading to a fractional error in the determination of the variance, or 
Tsky, of 1/,/Nsamp- For a more realistic measurement noise is also introduced, and 
in the above formula the noise is idealized as an independent additive Gaussian 
random signal, characterized in terms of a system temperature Tsys, to be added to 
Tsky to obtain the variance actually measured. The additive noise lumped together 
and known as the “system” temperature includes thermal emission from the atmo- 
sphere, Johnson noise from dielectric losses in the feedlines, noise in the detector 
itself — in other words, everything other than the sky temperature that one would 
measure from space with an ideal measuring device. 

The above formula was derived treating the incoming electromagnetic field 
as entirely classical, and for a thermal source this treatment is valid in the R- 
J part of the spectrum — that is, when Tyource > hv /kp. For the CMB, with 
Temp = 2.725K, this requires that v < vomp = kpTomp/h = 57GHz, meaning 
that quantum effects introduce significant corrections to the above formula, but one 
is still far from the regime where photons arrive in a nearly uncorrelated way, obey- 
ing Poissonian counting statistics, which is what happens for observations in the 
extreme Wien tail of the blackbody distribution. Before introducing the quantum 
corrections to Eq. (83), let us finish our discussion of this result based on classical 
theory extracting all the lessons to be learnt from this result. In the R—J regime, 
where N > 1, N being the photon occupation number, roughly speaking, photons 
arrive in bunches of N photons, so the discreteness of the photons is not an issue. 
As long as a fraction greater than about 1/N of the photons is captured, little addi- 
tional noise is introduced. The intrinsic noise of the incoming electromagnetic field 
is almost entirely classical in origin and can be modeled faithfully using stochastic 


TI-104 M. Bucher 


classical electromagnetic formalism. (For a more detailed discussion of these issues, 
see Ref. 129.) From the above formula, we learn that noise can be reduced by the 
following measures: (i) choosing as wide a bandwidth as possible, (ii) increasing the 
observation time, and (iii) increasing the number of detectors. We also learn that 
although lowering the internal noise of the detection system increases the accuracy 
of the measurements, once the system temperature is approximately equal to the 
brightness temperature of the source, lowering the system temperature further leads 
only to marginal improvement in sensitivity. 

We now derive the quantum corrected version of the radio astronomer’s equa- 
tion, using a Planck distribution instead of the classical description based on a 
deterministic stochastic Gaussian field. For the Planck distribution, where py = 
x /(1—2) with x = exp(—hv/kpT), 


((6N)*) = (N)? + (N) = N74 N, (84) 


In the R-J regime, where N > 1, the first term dominates, reproducing the result 
obtained using a classical random field as described above. But as the frequency 
is increased and one starts to pass toward the Wien tail of the distribution, the 
second term increases in importance. In the extreme Wien tail, where the second 
term dominates, we observe the fluctuations characterized by Poisson statistics, 
where the (rare) arrivals of photons are completely uncorrelated. This means that 
the fractional error for the number of photons counted must be increased by a 
factor of 
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and the quantum-corrected version of Eq. (83) becomes!!3 


or (tH + Toys ) . 1 = hy ) . (86) 
I T sky 4/ (AB)tobs 2kp (Tsky + Deve) 


14.2. CMB detector technology 


Having analyzed the performance of an ideal detector limited only by the intrinsic 
fluctuations of the incoming electromagnetic field, we now review the state of the 
art of existing detector technologies, which may be divided into two broad classes: 
(i) coherent detectors, and (ii) incoherent detectors. 

Coherent detectors transfer the signal from a feed horn or antenna onto a trans- 
mission line and then coherently amplify the signal using a low-noise amplifier, 
generally using high electron mobility transistors (HEMTs) and switching with a 
cold load or between different points on the sky in order to mitigate 1/f noise.'®° 
Coherent detection is an older technology. Coherent detectors have the advantage 
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that they can operate at much higher temperatures than the competing incoher- 
ent bolometric detectors. Avoiding active cooling through a cryogenic system is 
a definite plus, especially in space, where reducing risk of failure is important. 
However, unlike their incoherent competitors, at the high frequencies of interest 
for CMB observations, coherent detectors have noise levels far above the quantum 
noise limit, and until now coherent detectors have not been able to reach frequen- 
cies above circa 100 GHz, although some believe that this situation may improve. 
It is not presently known how coherent amplification would function in the Wien 
part of the blackbody spectrum. Coherent detectors were used for the COBE DMR 
experiment, for WMAP (including the frequencies 20, 30, 40, 60, and 90 GHz), and 
the Planck LFI (low frequency instrument), which included the frequencies 30, 45, 
and 70 GHz. One advantage of coherent detectors is their insensitivity to cosmic 
rays. 

Another property of coherent detectors is that for a single mode measurement, 
all four Stokes parameters (i.e., I, Q, U, V) can be measured simultaneously. For 
incoherent detectors one can measure only two Stokes parameters simultaneously. 
However for measurements outside the R—J part of the spectrum, this advantage 
of coherent detectors disappears, because in the Wien part of the spectrum the 
photons are not bunched to any appreciable degree. On the other hand, for coherent 
detectors, it is not possible to construct multi-moded detectors, where the number 
of detectors can be greatly reduced by sacrificing angular resolution, but not at 
the cost of less sensitivity on large angular scales. Coherent detectors are always 
single-moded, sampling at the diffraction limit. 

Incoherent detectors, unlike their coherent counterparts, do not attempt to 
amplify the incident electromagnetic wave. Rather the incident electromagnetic 
wave, or equivalently the stream of photons, is directly converted into heat, chang- 
ing the temperature of a small part of the detector of small heat capacity, whose 
temperature is monitored using a thermistor of some sort. Typically the detector 
is cooled to an average temperature much lower than that of the incident radia- 
tion. The bolometers of the Planck high frequency instrument (HFT) were cooled 
to 0.1K. Consequently each CMB photon produces many phonons in the detec- 
tor, and therefore the counting statistics of the phonons do not to any appreciable 
extent limit the measurement. Heat flows from the sky onto the detector and is then 
conducted to a base plate of high heat capacity and thermal stability. A thermal 
circuitry (which can be modelled as a sort of RC circuit with as many loops as time 
constants) is chosen with a carefully optimized time response. 

Several technologies are used for the thermistor. One of the most popular nowa- 
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days is transition edge sensors (TES), 
at the edge of the superconducting transition, so that minute changes in tempera- 
ture give rise to large changes in resistivity. A feedback circuitry with heating is used 
to maintain the assembly near a fixed location on the normal-superconducting tran- 


sition edge. One disadvantage of TES detectors is their low electrical impedance, 
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which requires that SQUIDs (superconducting quantum interference devices), which 
need careful magnetic shielding, be used for the readout. 

Several new promising technologies are being developed and perfected for the 
next generation of bolometric detectors. One such technology is kinetic inductance 
detectors (also known as KIDs).4!4” For these detectors, a resonant electric circuit 
is constructed by depositing a pattern of superconducting film on a thick dielectric. 
Photons incident on the superconducting film break Cooper pairs causing its surface 
impedance to vary with the incident photon flux, which in turn alters the resonant 
frequency. The shifts in frequency of the resonator allow the KID to be read out ina 
simple way without using SQUIDs. The surface impedance of the superconducting 
film is largely determined by the inertia of the superconducting electrons, hence the 
name “kinetic inductance.” 

Another possibly promising technology is the cold electron bolometer,!! 
ing a small metal filament connected to a superconductor on both sides by means of 
a junction with a thin insulator in between. A bias current that passes through the 
assembly acts to cool the electrons in the thin wire through a Peltier-like effect to a 
temperature far below that of the substrate. CEBs are hoped to be more resistant 
to cosmic ray spikes than other detector technologies. 

Cosmic rays have proved to be more problematic than anticipated in the Planck 
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experiment, and substantial effort was needed to remove the contamination from 
cosmic ray events in the raw time stream.'8! Some of the data was vetoed and 
corrections were applied to the long-time tails of the larger magnitude events. While 
experiments from the ground benefit from substantial shielding of cosmic rays by 
the Earth’s atmosphere, how best to minimize interference from cosmic rays will be 
a major challenge for the next generation of experiments from space, which target 
sensitivities more than two orders of magnitude beyond the Planck HFI. Another 
challenge will be multiplexing, which is necessary to reduce the cooling requirements 
when the number of detectors is greatly expanded. 


14.3. Special techniques for polarization 


If it were not for the fact that the polarized CMB anisotropy (i.e., the Q and U 
Stokes parameters) is much smaller than the anisotropy in the J Stokes parame- 
ter, measuring polarization would not pose any particular problems, because most 
detectors are polarization sensitive or can easily be made so, and there would be 
no need to discuss any techniques particular to polarization measurements. An 
inspection of the relative amplitudes of the spectra in Fig. 21 highlights the prob- 
lems encountered. In particular if one wants to search for B modes at the level 
of r ~ 107% or better, the requirements for preventing leakages of various types 
become exceedingly stringent, as we now describe. 

Let us characterize the problem for the most difficult polarization observation: 
measuring the primordial B modes. We may classify the leakages to be avoided 
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Fig. 21. Summary of T,E, and B anisotropy power spectra from scalar and tensor modes. The 
green curves (from top to bottom) represent the TT, TE and EE CMB power spectra for the 
“scalar” mode, while the blue curves represent the CMB isotropies for the tensor mode. The broken 
blue curves (from top to bottom) and the top solid blue curve represent the TT, TE, EE and BB 
anisotropies assuming a value of the tensor-to-scalar ratio of r = 0.1. The lower two solid curves 
represent the predicted BB anisotropy for r = 0.01 and r = 0.001. The red curve shows the 
BB power spectrum arising nonlinearly from the scalar mode as a result of gravitational lensing. 
(Credit: M. Bucher) 


by ordering them from most to least severe: (i) leakage of the CMB temperature 
monopole to the B mode, (ii) leakage of the T anisotropies to the B mode, and (iii) 
leakage of the E mode to the B mode. 

One of the solutions to many of these problems is polarization modulation. The 
idea is to place an element such as a rotating half-wave plate, whose orientation 
we shall denote by the angle 6, between the sky and the detector, preferably as 
the first element of the optical chain, as sketched in Fig. 22. Half-wave plates can 
be constructed using anisotropic crystals such as sapphire or using a mirror with 
a layer of wires a certain distance above it so that one linear polarization reflects 
off the wires while the other reflects off the mirror, resulting in a difference in path 
length. Mathematically, we have 
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Fig. 22. Polarization modulation by means of a rotating half-wave plate. We show a transmissive 
rotating half-wave plate (HWP), here placed in front of a microwave feed horn pointing directly 
at the sky. However for modern CMB experiments, the horn would generally be replaced with a 
telescope having intermediate optical elements and horns on the focal plane. The rotating HWP is 
birefringent, with its fast polarization axis represented by the line OO’. As explained in the main 
text, by measuring only that part of the signal varying at an angular frequency 42 (where 2 is 
the angular velocity of the rotating HWP), one can prevent leakage from the much larger Stokes 
parameter J into the linearly polarized components Q@ and U. The presence of the HWP, while 
complicating the instrument design somewhat, allows many hardware requirements to be relaxed 
substantially compared to what would be needed without polarization modulation. 


so that in terms of the Stokes parameters, expressed below as the matrix-valued 
expectation value (E;E;), so that 
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(88) 


By measuring only the component that varies with an angular frequency 40, we 
can measure the polarization without making two independent measurements of 
large numbers that are then subtracted from each other. The much larger Stokes 
intensity J does not mix because it remains constant in time. 

To see how polarization modulation helps, let us suppose that the vector 
piideal we Sean? tae me ie is replaced by the vector pee with 


piactual) = ce oe ae peel) = (I rm epare) 


det d det det > Where I is the 


Physics of the cosmic microwave background anisotropy TI-109 


Fig. 23. Planck polarization sensitive bolometers. Shown above is a photograph of one of the 
Planck polarization sensitive bolometers consisting of two grids of parallel wires oriented orthog- 
onally and placed on top of each other but separated by a small distance. Each grid has its 
own thermistor, allowing the two selected components of the linear polarizations to be read out 
separately. (Credit: ESA/Planck Collaboration) 


identity matrix and € represents a hopefully small but unknown uncorrected resid- 
ual error in the actual linear response of the detector pair. 

We discuss the setup for measuring polarization used in Planck as described 
above (see Fig. 23). We describe some of the possible systematic effects and how 
polarization modulation by means of a rotating half-wave plate would remove these 
systematic effects. Ideally the two orthogonal grids of wires would correspond but 
for the direction of the linear polarization to exactly the same circular beam on 
the sky and be perfectly calibrated, with no offset between the beam centers, nor 
any time-dependent drift of the differential gain. But in practice the beams are not 
identical and the gain and offset of the electronics drift in time. The centers of the 
beams do not coincide, and without corrections this would cause a local gradient 
of the T' to be mistaken for FE or B polarization. Differential ellipticity would cause 
the second derivative of T to leak into E and B. Moreover, the actual beams are 
more complicated and require three functions [i.e., maps of [(Q), Q(Q), and U(Q)] 
for a complete description. All the above effects disappear, at least ideally, with 
the polarization modulation scheme described above. Polarization modulation can 
resolve or mitigate a lot of problems but is not a panacea. For example, polarization 
modulation cannot prevent E — B leakages. If there are any errors in the calibration 
of the angles of the direction of the linear polarization in the sky, these errors act 
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to rotate E into B and vice versa. Moreover polarization modulation has been 
difficult to realize in practice for a number of reasons such as spurious signals due 
to microphonic coupling. Both continuously rotating and discretely stepped half- 
wave plates are options for polarization modulation. 


15. CMB Statistics Revisited: Dealing with Realistic Observations 


The analysis of a CMB experiment going from the raw time stream of data taken, to 
sky maps at a given frequency, then to a clean sky map containing only the CMB, 
and finally to a likelihood function, whose input is a complete theoretical power 
spectrum, involves many steps and details. Space does not permit a complete dis- 
cussion. The following Planck papers and references therein describe this analysis 
for the Planck experiment from the Planck 2013 Results series. See in particular the 
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following papers: Overview of products and results, 
data processing,!”? High frequency instrument data processing,!®° Component sep- 
162 and HFI energetic particle effects.!*! 
for other experiments. In this section, we limit ourselves to sketching a few issues 
and to describing an idealized statistical analysis. 

One of the main steps in CMB analysis is constructing a likelihood for the data 
given a particular theoretical model. Relative likelihood lies at the heart of almost 
all statistical inference, whether one uses frequentist methods, Bayesian methods, 
or an ecumenical approach borrowing from both doctrines. Here we shall discuss 
only how to formulate the likelihood, as how to exploit the likelihood is less specific 
to the CMB and details can be found in standard treatises on modern statistics. 
(See for example Ref. 125 for a brief overview and Ref. 210 for an authoritative 
treatment emphasizing the frequentist approach. For Bayesian sampling as applied 
to the CMB, see Refs. 122 and 57 and references therein.) 

Constructing a likelihood is straightforward for an idealized survey for which the 
noise in the sky maps is Gaussian and isotropic. For the simplest likelihood function, 
the input argument is a model for the CMB sky signal, or more precisely the 
parameters defining this model. This theoretical model defines an isotropic Gaussian 
stochastic process whose parameters one is trying to infer. In order to introduce 
some of the complications that arise, we start by writing down the likelihood L 
for the data given the model, or rather the variable —21n[£], which in some ways 
resembles y? in the Gaussian approximation. We have 


5 B CTT) +N, CTT sky) iN, 
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aration, Similar descriptions can be found 


(89) 


Here the parameters Ne represent the noise of the measurement. In the approxima- 
tion of white instrument noise — that is, with no unequal time correlations — we 
have Nz = No. The factor By represents the smearing of the sky signal due to the 
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finite beam width. For the beam profile approximated as having a Gaussian shape, 
By = exp[—o70?] where the sky intensity map is convolved with the Gaussian func- 
tion K(@) = (1/V 270?) exp[—6?/(207)]." Note that we ignore constant offsets in 
the log likelihood because for most statistical analysis only differences in the log 
likelihood are relevant. The presence of the normalization of the Gaussian implies 
that a quadratic representation of the log likelihood is not exact and higher order 
corrections are needed to avoid bias, especially at low @. 

To generalize the above expression to include polarization, we define the matrix 
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In order to generalize to less idealized (i.e., more realistic) situations, it is use- 
ful to rewrite the above results in a more abstract matrix notation, in which the 
expressions can be interpreted simultaneously as expressions in real (i.e., angular) 
space and as expressions in harmonic space, so that 


—2In[L] = det(Czn + N) + t7 (Cin +.N)'t. (92) 


Here the vector t represents the observed sky map, Cz, is the covariance matrix 
of the underlying sky signal (in general depending on a number of parameters 
whose values one is trying to infer), and N represents the covariance matrix of the 
instrument (and other) noise. In the idealized case where everything is isotropic, 
this is simply a rewriting of Eq. (89), but in less idealized cases the likelihood in 
Eq. (92) is correct whereas Eqs. (89) and (91) are not applicable to the more general 
case. 

Let us now consider the effect of partial sky coverage, which may result from a 
survey that does not cover the whole sky or from masking portions of the sky such 
as the galaxy and bright point sources where contamination of the primordial signal 
cannot be corrected for in a reliable way. If the vectors and matrices in Eq. (92) are 
understood as representations in pixel space, at least formally there is no difficulty in 
evaluating this expression in the subspace of pixel space representing the truncated 
sky. If the dimensionality of the pixelized maps were small, there would be no 
difficulty in simply evaluating this expression by brute force. Another complication 
arises from inhomogeneous noise. Most surveys do not cover the sky uniformly 
because requirements such as avoiding the sun, the earth, and the planets as well 
as instrumental considerations lead to nonuniform sky coverage. This leads to a 


“Note that in the literature, beam widths are generally specified in terms of FWHM rather than 
using the Gaussian definition above. 
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noise matrix that has a simple representation in pixel space (under the assumption 
of white noise), but not in harmonic space. 

Unfortunately evaluating likelihoods such as in Eq. (92) exactly is not possi- 
ble for full-sky surveys at high angular resolution such as the Planck survey for 
which pixelized maps include up to about Npix = 5 x 10’ pixels. While Cy, is 
simple (i.e., sparse) in harmonic space, it is dense in pixel space, and the opposite 
holds for N (ignoring correlated noise), so no representation can be found to sim- 
plify the calculation in which all the matrices are sparse. Since even enumerating 
the matrices involves O(Npix”) operations, and inverting these matrices or taking 
their determinant requires O(N”) operations, it is immediately apparent that 
brute force will not work, and more clever, approximate techniques are required. 
Pixel based likelihoods are however used at low £, where other approximations have 
difficulty representing the likelihood, in large part because the cosmic variance is 
non-Gaussian. 

While the likelihoods above assumed an isotropic Gaussian stochastic process 
for the underlying theoretical model generating the anisotropy pattern in the sky, 
these likelihoods can be generalized to include nonlinear effects, such as non- 
Gaussianity through fyz, gravitational lensing, and certain models of weak sta- 
tistical anisotropy, to name just a few examples. Using the likelihood as a starting 
point, possibly subject to some approximations, has the virtue that one is guaran- 
teed that an optimal statistical analysis will result without any special ingenuity. 

In the above simplified discussion, we have not touched on a number of very 
important issues, of which we list just a few examples: asymmetric beams, far side- 
lobe corrections, bandpass mismatch, errors from component separation, estimating 
noise from the data, and null tests (also known as “jackknives”). The reader is 
referred to the references at the beginning of this Section for a more complete 
discussion and references to relevant papers. 


16. Galactic Synchrotron Emission 


An inspection of the single frequency temperature maps from the Planck space mis- 
sion (see Fig. 8), shown here in galactic coordinates so that the equator corresponds 
to the galactic plane, suggests that the 70 GHz frequency map is the cleanest, as 
the excess emission around the galactic plane is the narrowest at this frequency. 
But at lower frequencies, the region where the galactic contamination dominates 
over the primordial CMB temperature anisotropies widens. This contamination at 
low frequencies is primarily due to galactic synchrotron emission, resulting from 
ultrarelativistic electrons spiralling in the galactic magnetic field, whose strength 
is of order a few microgauss. (See for example Ref. 78.) This emission is gener- 
ally described as nonthermal because the energy spectrum of charged particles is 
non-Maxwellian, especially at high energies. The electrons in question reside in the 
high energy tail of the electron energy spectrum, described empirically as a falling 
power law spectrum. Models of cosmic ray acceleration, for example arising about 
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a shock from an expanding supernova remnant, are able to explain spectra hav- 
ing such a power law form.?° If the radiating charged particles were nonrelativistic, 
their radiation would be emitted at the cyclotron frequency w, = eB/mce, with very 
little emission in higher harmonics, and given the measured values of the galactic 
magnetic field, one could not thus explain the observed high frequency emission. 
However, if the charged particles are highly relativistic, due to beaming effects the 
bulk of the radiation is emitted in the very high order harmonics, allowing the 
observed emission to be explained for reasonable values of the galactic magnetic 
field.©? Although the data concerning the three-dimensional structure of the galac- 
tic magnetic field is at present in a very rudimentary state, and little is known 
observationally about the cosmic ray spectrum outside our immediate solar neigh- 
borhood, the physical mechanism underlying galactic synchrotron emission is well 
understood physics. One important point is that no matter what the details of the 
cosmic ray energy spectrum may be, the kernel of the integral transform relating 
this distribution to the frequency spectrum of the observed synchrotron emission 
applies considerable smoothing. Thus we can assume with a high degree of confi- 
dence that the synchrotron spectrum is smooth in frequency. Theory also predicts 
that this synchrotron emission will be highly polarized, at least to the extent that 
the component of the galactic magnetic field coherent over large scales is not negli- 
gible. This expectation is borne out by observations. It is customary to describe the 
synchrotron spectrum using a power law, so that its R-J brightness temperature 
may be fit by an Ansatz of the form 


% 

Tr—4(Q,V) = Tr—s(Q, V0) (=) ; (93) 
where empirically it has been established that a = 2.7-3.1. The same holds for 
the polarization — that is, the Stokes Q and U (and also V) parameters. In the 
ultrarelativistic approximation this emission is primarily linearly polarized, but 
there is also a smaller circularly polarized component suppressed by a factor of 
1/y. Variations in the spectral index depending on position in the sky have been 
observed.®" 

A description of the WMAP full-sky observations of synchrotron emission 
appears in the first-year WMAP foregrounds paper??? and the polarization obser- 
vations are described in the three-year WMAP paper on foreground polarization.??° 
A greater lever arm, in particular for studying the synchrotron spectral index, can 
be obtained by including the 408 MHz Haslam map covering almost the entire sky.®° 


17. Free—Free Emission 


Another source of low-frequency galactic contamination is free—free emission aris- 
ing from Bremsstrahlung photons emitted from electron—electron collisions in the 
interstellar medium, and to a lesser extent from electron-ion collisions. Like galac- 
tic synchrotron emission, the free—free emission brightness temperature falls with 
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increasing frequency, but the fall-off is slower than that of the galactic synchrotron 
emission. 

Unlike synchrotron emission, free—free emission is not polarized. Ha (n = 3 > 
n = 2) emission resulting from recombination of ionized atomic hydrogen can be 
used as a template for removing the free-free component, because the Ha emission 
likewise is proportional to the square of the electron density. However, because the 
Tya(T) and Dee-tree(L, v) do not have an identical temperature dependence, free— 
free removal using an Ha template has some intrinsic error. [y.(T) is defined as 
the constant of proportionality in the emissivity relation: €47,_ =T el Tae" where 
€Ha is a bolometric emissivity (power per unit volume) and n, is the density of free 
electrons. Likewise €frec_free(V) = V'free-free(T, ”)Ne” where the emissivity has units 
of power per unit frequency per unit volume per unit solid angle. 


18. Thermal Dust Emission 


In Sec. 16 we discussed galactic synchrotron emission, whose R—J brightness tem- 
perature rises with decreasing frequency, making it exceedingly difficult to measure 
CMB anisotropies at frequencies below + 20 GHz, especially close to the galac- 
tic plane where the galactic synchrotron emission is most intense. We also saw 
how the fact that the brightness temperature increases with decreasing frequency 
can be understood as a consequence of the synchrotron optical depth being much 
smaller than unity and increasing with decreasing frequency. Eventually when the 
frequency is low enough so that the optical depth is near one, the brightness temper- 
ature stops rising and approaches a temperature related to the cosmic ray electron 
kinetic energy. 

This section considers another component contributing to the microwave sky: 
the thermal emission from interstellar dust, whose brightness temperature increases 
in the opposite way — that is, the brightness temperature increases with increasing 
frequency. This behavior results because at microwave frequencies, the wavelength 
greatly exceeds the typical size of a dust grain. At long wavelengths, the cross- 
section for absorbing or elastically scattering electromagnetic radiation is much 
smaller than the geometric cross-section Ogeom © a? where a is the effective grain 
radius. The dust grains can be approximated as dipole radiators in this frequency 
range because higher order multipoles are irrelevant. The dependence of the cross- 
section on frequency can be qualitatively understood by modeling the electric dipole 
moment of the grain as a damped harmonic oscillator where the resonant frequency 
is much higher than the frequencies of interest. In this low frequency approximation 


dig ce (Sy a? (94) 


for absorption, and for the elastic, or Rayleigh, scattering component, which 
becomes subdominant at very low frequencies, 


4 
Oelastic ™ (5) a”. (95) 
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This is a simplistic approximation that we later shall see is not accurate, but it 
does provide intuition about the qualitative behavior expected. 

Our knowledge of interstellar dust derives from combining different observations 
spanning a broad range of frequencies, from the radio to the UV and even X-ray 
bands, in order to put together a consistent theoretical model able to account 
simultaneously for all the observations. (For a nice recent overview, see for example 
Refs. 48 and 49.) In this section, we restrict ourselves to discussing the thermal 
microwave emission properties of the dust. 

An empirical law commonly used to model thermal dust emission at low fre- 
quencies is a Planck blackbody spectrum modulated by a power law emissivity, or 
graybody factor, 


B 
Vv By, Taust) 
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where the Planck spectrum is given by 
2hv 1 
B(y;T) = > (97) 


The physical basis for this modification to the Planck spectrum can be partially 
motivated by the following argument based on the linear electric susceptibility of 
the dust grain, following a line of reasoning first suggested by EM Purcell in 1969.1°8 
The simple argument, subject to a few caveats, suggests that 6 = 2. However the 
observational data do not bear out this prediction and instead are better fit by a 
power law with @ = 1.4 — 1.6.144158 Let y(w) represent the linear susceptibility of 
a dust grain, which as a consequence of the Kramers—Kronig dispersion relations 
must include both nonvanishing real and imaginary parts. Here d(w) = y(w)E(w), 
where d(w) and E(w) are the grain electric dipole moment and the surrounding 
electric field, respectively. As a consequence of causality, y(w) is analytic on the 
lower half-plane. Its poles in the upper half-plane represent those decaying mode 
excitations of the grain that are coupled to the electromagnetic field. Moreover, on 
the real axis x(—w) = [y(+w)]*. If we assume analyticity in a neighborhood of the 
origin, we may expand as a power series about the origin, so that 


x(w) = wo + ixiw + XQW* +++, (98) 


and xo, X1; X2;--- are all real. It follows that the energy dissipated is characterized 
by the absorptive cross-section 


Oabs (w) = —— (99) 


and at low frequencies 
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which corresponds to an emissivity index 3 = 2 as w — 0. The perhaps questionable 
assumption here is that there exists a circle of nonzero radius about the origin of 
the w-plane where y(w) has no singularities. If we postulate that there is an infinite 
number of poles of suitably decreasing strength whose accumulation point is w = 0, 
we may evade the conclusion that asymptotically as w — 0, @ — 2. For an insightful 
discussion of how such low-frequency poles could arise, see C. Meny et al.!°° 


19. Dust Polarization and Grain Alignment 


The fact that interstellar dust grains are aligned was first seen through the polar- 
ization of starlight at optical frequencies in 1949.78 Since this discovery an 
abundant literature has been amassed elucidating numerous aspects of dust grain 
dynamics. The best bet solution to the alignment problem has evolved over the 
years as additional relevant physical effects were pointed out. It is unclear whether 
the current understanding will remain the last word on this subject. The most naive 
solution to the alignment problem would be some sort of compass needle type mech- 
anism with some dissipation. However, it was discovered early on that in view of 
the plausible magnitudes for the galactic magnetic field, such an explanation cannot 
work.” The theoretical explanation that has evolved involves detailed modeling of 
the rotational dynamics of the grains, which as we shall see below almost certainly 
rotate suprathermally. 

For microwave observations of the primordial polarization of the CMB at ever 
greater sensitivity, the removal of contamination from polarized thermal dust emis- 
sion is of the utmost importance. This fact is highlighted by the discussion of how 
to interpret the recent BICEP2 claim (discussed in Sec. 11.1) of a detection of 
a primordial B mode signal based on essentially a single frequency channel. This 
interpretation however seems unlikely given subsequent Planck estimates of the 
likely dust contribution.1°° The problem is that little data is presently available on 
polarized dust emission at microwave frequencies. Current models of polarized dust 
involve a large degree of extrapolation and are in large part guided by simplicity. 
From a theoretical point of view, polarized dust could be much more complicated 
than the simplest empirical modeling suggests. For example there is no a priori 
reason to believe that polarized emission from dust must follow the same frequency 
spectrum as its emission in intensity. It could well be that the weighting over grain 
types gives a different frequency dependence, or that as more precise dust modeling 
is called for, the need to include several species of dust with spatially nonuniform 
proportions will become apparent. Moreover, based on theoretical considerations, 
it is not clear that there should be a simple proportionality between polarization 
and the strength of the magnetic field. If the BICEP2 B mode signal is confirmed 
as a dust artefact, we could see ourselves in the coming years trying to detect val- 
ues of the tensor-to-scalar ratio around r ¥ 1073, which would require exceedingly 
accurate polarized dust removal from the individual frequency maps, and before 
this happens a wealth of new observational data on polarized dust emission will 
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become available. If, on the other hand, a detection of B modes at a large value 
of r is made, as one tries to map out the precise primordial B mode spectrum, for 
example to measure np and thus check the consistency between Ps(k) and Pr(k), 
the need for accurate dust removal will likewise be great, but for slightly different 
reasons. 

Since the observational inputs are likely to change, here we particularly empha- 
size the underlying physics of the alignment, which may be updated, but is likely 
to change less. The theoretical issues of dust alignment have so far not been widely 
discussed among those not specialized in the ISM (interstellar medium). Neverthe- 
less, we believe that the questions raised will become increasingly relevant for CMB 
observations in the future, and this is why a thorough discussion is presented here. 

We shall find that the alignment of interstellar dust grains is closely linked to 
the rotational dynamics of a dust grain. We therefore start with a study of the 
rotation of a dust grain. Mathematically, the simplest case to analyze is a prolate 
(needle-like) or oblate (flattened spherical) dust grain with an axis of azimuthal 
symmetry, which constitutes a symmetric top. In this case, the motions can be 
solved in terms of elementary functions. For the more general case of an asymmetric 
grain (where none of the three principal values of the moment of inertia tensor are 
equal) integration of the equations of motion in the absence of torques is more 
difficult, but can be achieved using Jacobian elliptic functions.??° 

In the noninertial frame rotating with the dust grain, which is convenient 
because in this frame the moment of inertia tensor is constant in time, two 
types of inertial forces arise: (i) a centrifugal force, described by the potential 
Veer (x)/m = —0?(t)x? + (Q(t) -x)?, and (ii) a Coriolis force Feor/m = —2Q(t) x v, 
which for our purposes is unimportant because the velocities are negligible in the 
co-rotating frame. In order to understand the alignment process, we must model 
the solid as being elastic with dissipation included. 


19.1. Why do dust grains spin? 


A flippant answer to the above question might be: Why not? If we can associate a 
temperature with the three kinetic degrees of freedom of the dust grain, we would 
conclude that 


1 3 
al ~ 9 kpT rot: (101) 


Early discussions of rotating dust grains supposed that the rotational degrees of 
freedom are subject to random torques, so that 


(Tr) =0. (102) 


Later it was realized that suprathermal rotation is plausible, where the implied 
temperature of the rotation can greatly exceed the effective temperature of any of 
the other relevant degrees of freedom. 

The question now arises what temperature one should use for T,,,. A natural 
candidate might be the kinetic temperature of the ambient gas Tin. Indeed one 
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expects molecules of the ambient gas to collide with the dust grains, and such 
collisions do provide random torques, causing the angular momentum to undergo 
a random walk accompanied by a dissipation mechanism, as must be present to 
prevent the angular velocity from diverging with time. 


19.2. About which axis do dust grains spin? 


This question has two parts. First, we may ask what is the alignment of the co- 
rotating internal (body) coordinates with respect to the angular momentum vector. 
Second, we may ask how the angular momentum aligns itself with respect to the 
inertial coordinates of the ambient space. In this section, we consider only the first 
question, postponing the second question to a later section. 

But for exceptional situations where two or more eigenvalues of the moment 
of inertia tensor coincide, the principal axes of moment of inertia tensor provide a 
natural set of axes for the dust grain, and we may order the eigenvalues so that I, < 
In < Iz. We argue that the dust grain tends to align itself so that its Jz axis is parallel 
to the angular momentum. This orientation of the grain minimizes the rotational 
energy subject to the constraint that the total angular momentum remains constant. 
If the grain starts in a state of random orientation, the grain wobbles. While L 
remains constant, 2(¢) = I~1(t)L (expressed in inertial coordinates) does not, but 
rather fluctuates, exciting internal vibrational modes that dissipate. As the excess 
rotational energy is dissipated, an alignment as described above is achieved.” The 
relevant question is how this time scale compares to other time scales of the grain 
rotation dynamics. It turns out that this time scale, which we denote t;__1, is 
short. In the equations that follow, we exploit the shortness of T;_, compared to 
the other relevant time scales. This hierarchy of time scales allows us to simplify the 
equations by excluding the degrees of freedom that describe the lack of alignment of 
the principal axis of maximum moment of inertia from L, leading to a considerable 
simplification of the formalism. 


19.3. A stochastic differential equation for L(t) 


Mathematically, we may describe the combined effects of the torque from random 
collisions and the associated damping through the coherent torque using a stochastic 
differential equation of the form 


Ete) =a (Le) - LEE ) + Pan (103) 


YMuch to the dismay of NASA engineers, this is exactly what happened to the Explorer 1 rocket, 
the first US satellite launched in 1958 following Sputnik, due primarily to dissipation from its 
whisker antennas. The elongated streamlined cylinder was initially set in rotation about its long 
axis, but subsequently, after an initial wobble phase, ended up spinning about an axis at right 
angles in the body coordinates. Since then rotational stability, also relevant in ballistics, has 
become a carefully considered design issue for spacecraft. 
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In this equation, the torque described by the first term is coherent with respect to 
the body coordinates of the dust grain. Later we shall also consider torques that are 
coherent with respect to the ambient inertial coordinates, which must be modeled 
in a different way. Here the stochastic Gaussian forcing term has the expectation 
values 


(Pran,i(t)) =0, 
(Dyani(t)D ran, (t’)) = [er oijO(t = ape 


where the angular momentum diffusion coefficient has units [ur] = 
(Torque)? /(Time). 

Let us analyze the stationary ensemble defined by the above equation — in 
other words, the probability function p(L) toward which the biased random walk 
defined by the above equation evolves after a long period of time. In the low- 
temperature limit, |L| ~ LZ with negligible fractional fluctuations, and the direction 
of the angular momentum vector fy, = L(¢t)/|L(¢)| follows a random walk on the 
surface of the sphere with diffusion coefficient zz = up/L?. In the opposite high 
temperature limit, the L term is a sideshow or small correction, and the main 
effect is the competition between the first and second terms leading to a thermal 
ensemble, which for an isotropic moment of inertial tensor would take the form 


(104) 


3/2 
pL) = (5) exp|-a0"), (105) 


where 3 = a/jiy. Without other sources of dissipation or random torques, we expect 
that 6 = 1/(2kpTiintmax) where Imax is the largest eigenvalue of the moment 
of inertia tensor of the particular grain being modelled and Tin is the kinetic 
temperature of the ambient gas. It is straightforward to generalize Eq. (105) to 
the intermediate case. It was however later noted that a number of mechanisms 
exist for which ([) 4 0, which can give rise to what is known as “suprathermal 
7189 __ that is, rotation at an angular velocity larger than any temperature 
associated with the other degrees of freedom interacting with the dust grain. 


rotation 


19.4. Suprathermal rotation 


To see how such suprathermal rotation might come about, consider the caricature 
of a dust grain as sketched in Fig. 24. Let us assume that the dust grain emits 
radiation isotropically in the infrared, but that in the visible and UV bands radia- 
tion is absorbed or reflected by the surfaces indicated in the figure. Let us further 
assume an isotropic visible or UV illumination. The blackened (perfectly absorbing) 
faces suffer a radiation pressure equal to half that of the mirrored surfaces, giving a 
coherent torque in the direction indicated in the figure. The presence of two distinct 
temperatures, reminiscent of a thermal engine, is crucial to avoid thermal equilib- 
rium for the rotational degree of freedom. As in the thermodynamics of an engine, 
there is a heat flow from a hotter temperature to a colder temperature with some 
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Fig. 24. Coherent torques from grain radiation pressure. 


of the energy being siphoned off in the form of work exerted on the three rota- 
tional degrees of freedom of the dust grain. Other possibilities for coherent torques 
result from photoelectron emission at privileged sites and the catalysis of molecular 
hydrogen on favored sites on the grain surface. In the former case, the incident 
UV photon is almost pure energy due to its masslessness, but the emitted elec- 
tron carries substantial momentum for which there is a recoil. Molecular hydrogen 
forms almost exclusively through catalysis on a grain surface. Unlike in ordinary 
chemistry where truly two-body collisions are an exception and activation barriers 
prevent reactions from establishing equilibrium, for molecular hydrogen formation 
in the cold interstellar medium, the bottleneck is the lack of three-body collisions 
and the lack of an efficient mechanism for carrying away the excess energy, so that 
the hydrogen atoms combine rather than simply bouncing off each other. In princi- 
ple, the energy can be lost radiatively, but the rate for this is too slow. Given that 
4.2eV is liberated, the recoil at selected sites can produce a substantial coherent 
torque on the grain. 


19.5. Dust grain dynamics and the galactic magnetic field 


Much of the dust grain rotational dynamics does not single out any preferred direc- 
tion. To explain the polarization of starlight and of the thermal dust emission at 
microwave frequencies, some mechanism is needed that causes the dust grains to 
align themselves locally at least along a common direction such as that of the 
magnetic field. 
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19.5.1. Origin of a magnetic moment along L 


Two effects generate a magnetic moment along the axis of L: 


(i) Inhomogeneous electric charge distribution in the dust grain. The 
second moment of the electric charge density about the principal axis of the largest 
eigenvalue of the moment of inertia tensor (denoted z) induces a magnetic dipole 
moment as the grain rotates about this axis, according to 


p= 20, / d@rpe(r)(a? +”), (106) 


where p-(r) is the electrostatic charge density of the dust grain. In general, and 
particularly when the grain structure is amorphous, the net charge density is non- 
vanishing, so the integral above does not vanish. This effect is known as the Rowland 


effect. 


(ii) The Barnett effect. Minimizing the free energy tends to align the free spins of 
a rotating body with the direction of rotation. In nonferromagnetic substances this 
effect is usually negligible, but rotating dust grains present an extreme environment 
because of their minute size and isolation from their environment. Interstellar dust 
grains are among the fastest rotating macroscopic objects known. 

To understand the origin of the Barnett effect, consider an isolated rotating 
body, whose conserved total angular momentum may be decomposed into two parts 


Liot = Louk -F Stree. (107) 


We consider the energetics of the exchange of angular momentum between the 
bulk and spin degrees of freedom. We assume the spins to be free neglecting any 
interactions between them. In this case, the Hamiltonian for the system is 


ya Lbutk _ (Etot — 8)? 


a ae? nes 
and retaining only the term linear in the spin, we obtain 
H=-.-S, (109) 


where Q(t) = I-/(¢)L is the angular rotation velocity. The spins want to align 
themselves with the rotation to lower the rotational energy.!!° So far we have not 
included any terms opposing complete alignment, so at J’ = 0 complete alignment 
would result with S = SayailableQ/|Q|. But at finite temperature entropic consider- 
ations oppose such alignment, and it is the free energy F = H — T'S(S) or 


F(S;Q) = -Q-S +aTSs? (110) 


that is minimized. Far from saturation, when the degree of alignment is small, this 
quadratic term provides a good approximation to the entropy of the spin system. 
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19.6. Magnetic precession 


Having established that a spinning dust grain has a magnetic moment parallel to 
its angular momentum vector, which is aligned with the principal axis of the largest 
eigenvalue of the moment of inertia tensor, we now consider the effect of the torque 
resulting from the interaction with the ambient magnetic field. The interaction 
Hamiltonian is 


H=-—w-B, (111) 
from which it follows that 
L=px B= mL xB. (112) 


In other words, the angular momentum vector precesses at a frequency Wprec = 
‘Ym|B| around the magnetic field. Under this precession, the angle between L and 
p remains constant, meaning that this effect neither aligns nor disaligns the grain 
with respect to the local magnetic field direction. 

We estimate the time scale of this precession. Let S denote the spin polarization 
density (angular momentum per unit volume), which under saturation (i.e., all 
free spins aligned) would have a magnitude Szat, whose order of magnitude can 
be estimated by assuming approximately Avogadro’s number N, of spins having a 
magnitude h in a cubic centimeter volume, giving Ssat © O(1)(4x 1074 )erg-s-cm7?. 
We obtain the following free energy density (per unit volume): 


1 s 
Fo(S:T) = 5(keT)Nopin (=) (113) 


Here for a paramagnetic substance the order of magnitude for Nissi might be around 
1074 cm~? but for a ferromagnetic substance could be much smaller, corresponding 
to the density of domains. Minimizing the free energy with respect to S gives the 
spin polarization density 


—S sat = 
S= Gee IgQ. (114) 


We express the constant of proportionality as if it were a moment of inertia because 
it has the same units. 
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19.6.1. Barnett dissipation 


The coupling in the Barnett effect between the bulk and spin angular momenta 
does not act instantaneously. Rather as the instantaneous angular velocity 2(t) 
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changes, there is a lag in the response of S(t). In his landmark 1979 paper,'®® 
Purcell pointed out that this lag would help align the body coordinates so that the 
axis of maximum moment of inertia is parallel to the angular momentum vector. 
Purcell coined the name Barnett dissipation for this effect, whose magnitude is 
greater than the mechanical dissipation mentioned above. 

Two types of free spins participate in the Barnett effect: the electronic spins 
and the nuclear spins. The electronic spins are much more relevant for establishing 
a magnetic moment along L as discussed above in Sec. 19.5.1 because electronic 
magnetic moments are larger than nuclear magnetic moments by a factor of about 
(m,/me-). But for dissipation, this strong coupling is a liability. The nuclear spins 
contribute predominantly to Barnett dissipation because of their longer lag time. 

Mathematically, the coupled equations 


S(t) = —7(S(t) — a(t), 


L(t) = +7(S(@) — aQ(Z)), 


(116) 


describe this effect. We may integrate out S(t) to obtain the following equation 
in inertial coordinates for the torque on L resulting from the exchange of angular 
momentum between the bulk and spin degrees of freedom 


L(t) = —ayA(t) + a7? ia dt exp[—yr|Q(t — 7). (117) 


To calculate how fast this exchange of angular momentum aligns the axis of 
maximum moment of inertia with the angular momentum vector, we calculate the 
averaged rotational energy loss rate of the bulk degrees of freedom retaining only 
secular contributions. This calculation is simpler than keeping track of torques. 
The degree of alignment of Q(t) with the axis of maximum moment of inertia 
is a function of the rotational energy in the bulk rotational degrees of freedom. 
The instantaneous power transferred from the bulk rotation to the spin degrees of 
freedom is given by 


Poss(t) = 24) L(t) = -O(4) S(t) 
= =1,0) 1 {9 7 f° at'exol-wint-1)}, ars) 


and to obtain the averaged or secular contribution, we decompose Q(t) into har- 
monic components of amplitude Q, and frequency w,, so that 


(a0) . {20 -af dt! expl-rejene— 7 DI arnt (119) 


It follows that 
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We see that the stationary component (with w_ = 0) does not contribute to the 
loss but the precessing components dissipate. 

We obtain an order of magnitude for the alignment rate by assuming that 
(6I)/I = O(1) and that except for the stationary component wa © Wmax = L/Imax; 
so that the excess energy is approximately 


excess x OD gates” sin? 6 (121) 


where 6@ is the disalignment angle, and the rate of dissipation is approximately 


2 
Pross © O(1)yIgw? = sin? 6, (122) 


max 
Op > Wace 
so that 
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Yalign ~ TE eee (123) 


The action of the imperfect coupling of the spins to the bulk angular momentum 
and bulk rotational velocity is somewhat analogous to that of a harmonic balancer 
on a rotating shaft (such as a crankshaft), a common device in mechanical engi- 
neering. Here however the weakly and dissipatively coupled flywheel is replaced by 
the internal spin degrees of freedom. 


19.7. Davis—Greenstein magnetic dissipation 


If a grain rotates with an angular velocity along the direction of the magnetic field, 
in the frame co-rotating with the grain, the magnetic field is time independent. 
However the component of the angular velocity in the plane normal to the magnetic 
field Q, gives rise to an oscillating magnetic field of angular frequency 2. In 
general, the magnetic susceptibility at nonzero frequency has a nonzero imaginary 
part X%m,i, where 


Xm(W) = Xm,r(w) + 1Xm,i(w)- (124) 


The lag in the response of the magnetization to the applied driving field gives rise 
to dissipation accompanied by a torque that tends to align (or antialign) the spin 
with the magnetic field. This effect is known as the Davis-Greenstein mechanism.*? 
The alignment tendency of the Davis—Greenstein mechanism may be described 
by the pair of equations 
. 1 : 
Li =-——lL., lL) =0, (125) 
TDG 
where the decay time Tpqg indicates the time scale of the Davis—Greenstein relax- 
ation process. In deriving this simple form, we have assumed that Tpree < Toa, 
so that we may average over many precessions. We also assume that the alignment 
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with the axis of maximum moment of inertia likewise is very fast. If these assump- 
tions do not apply, an averaged equation of the simple form above does not hold, 
and these effects need to be analyzed simultaneously. 

A key issue in evaluating the viability of the Davis—Greenstein mechanism is 
the frequency of reversals of the direction of the coherent torque, termed “cross- 
over” events in the literature. If we consider for example coherent torques from the 
formation of molecular hydrogen at a small number of catalysis sites on the grain 
surface, the orientation of this torque depends on the structure of the outermost 
surface atomic monolayer, and its structure is likely to change as new layers are 
deposited or as old layers are eroded away. Consequently, the long term coherence 
time of the “coherent” torque is uncertain. Under the assumption that the alignment 
of the body coordinates with the angular momentum is fast, it is the evolution of 
the coherent torque projected onto the axis of maximum moment of inertia that 
matters. While the grain rotates suprathermally, its direction is relatively stable 
against random perturbing torques, but when its rotation velocity is in the process 
of changing sign and of the same order as the would-be “thermal” rotation velocity, 
the grain is particularly susceptible to changing orientation as a result of random 
torques as discussed in detail in Refs. 262 and 263. 

In deciding whether tpg is too long for producing a significant degree of align- 
ment, we must consider competing effects that tend to disalign the grains, which 
modify the above equation to become a stochastic differential equation of the 
form 


: 1 1 7 L 
L = —-—— (L— ég(ég-L L — Lé;) 4 t 12 
Ta ( éz(ép - L)) ea ( ép) ae n(t) (126) 


where 6g = B/|B|, 6, = L/|L|, and n(t) is a normalized white noise source whose 
expectation value is given by 


(ni(t) nj (t’)) = 6(t — 1’). (127) 


If Tre S Trand, in the absence of a magnetic field, L undergoes a random walk on the 
surface of a sphere of radius L. If tpg < Trana the magnetic dissipation is strong 
enough to bring about almost complete alignment. However if, on the other hand, 
TDG > Trand, the magnetic dissipation leads to a negligible amount of alignment. 


19.8. Alignment along B without Davis—Greenstein dissipation 


Because of the magnetic precession discussed in Sec. 19.6, an average alignment 
with the magnetic field direction (or more precisely with a direction very close to 
the magnetic field direction) can be obtained if there is a tendency toward alignment 
with some other direction characterized by a time scale tajjgn where talisn > tprec- 


The effect of precession about an axis Q (presumably along +B) in competition 
with alignment about another direction fi is described by the ordinary differential 
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equation 
L=QxL-—7(L—-Ab). (128) 


This equation has two time scales: Q~! and y~!. We are primarily interested in fast 
precession (i.e., 2 >> 7). Equation (128) is a linear equation whose solution decom- 
poses into a stationary part and transient spiralling in to approach the stationary 
attractor. We solve 


—y7 +2 0 Li nid 
—-Q Sy 0) Lie = —yL nig 5 (129) 
0 0 ty, Ly ny, 
so that 
L = “7 Fs yQ Ta 
11 2402 2 “2 +Q2 11; 
2 
ee gg NE ee 
Tie = 5 Q2 Ln. T 72 os Q2 Ina, (130) 
Ly => Li. 


In the fast precession regime of most interest to us, L = LQ(Q - A)/Q?, with cor- 
rections suppressed by a factor (y/Q). The precession has no alignment tendency 
of its own, but if the precession period is shorter than the time scale of the alterna- 
tive alignment mechanism, the precession acts to suppress alignment in the plane 
normal to 2. 

If it turns out that radiative torques combined with the fast precession described 
above is the solution to the alignment problem, a number of interesting observa- 
tional consequences follow. Blind data analysis might suggest that one should be 
able to predict polarization from the thermal dust emission intensity combined with 
a map of the galactic magnetic field, perhaps deduced from rotation measures and 
synchrotron emission. However, if the above mechanism is correct, this procedure 
will not work, even with a three-dimensional model of the unpolarized dust emission 
and galactic magnetic field. In the above equations, provided that the galactic mag- 
netic field is strong enough, its direction serves to define the direction of the grain 
alignment but its precise magnitude has little relevance. The above mechanism can 
also accommodate a large amount of small-scale power in the polarized dust sig- 
nal, even if both the galactic magnetic field and the dust density are smooth. One 
might expect the field characterizing the radiative alignment tendency L(x)A(x) to 
include an appreciable component varying on small scales. 


19.9. Radiative torques 


Electromagnetic radiation scattered or absorbed by the dust grain in general exerts 
a torque on the dust grain, as already modeled in Sec. 19.4 for isotropic illumination, 
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which is simpler because the torque direction is constant in the co-rotating body 
coordinates. Here we generalize to anisotropic illumination. 

Let T'(v, 7%) expressed in body coordinates be the torque exerted on a dust grain 
by monochromatic electromagnetic radiation of frequency v and unit spectral radi- 
ance propagating in the ” direction. For an illumination described by the spectral 
radiance I(v, 7), the torque exerted on the dust grain in inertial coordinates is 
given by 


r= f- dv e dal (v,R~'(t)Ar)I (v, A). (131) 


The complication here arises from the need to include both inertial and co-rotating 
body coordinates, related by the transformation Xjner = R(t)Xbody- 

Assuming that the time scale for radiative torques is long compared to the 
moment of inertia alignment time scale 7;_,, we may take 2 to be aligned with L 
and define an averaged kernel I'(v, 7; f,), defined as 


20 
T(v, Ar; fy) = x | dxT (vy, RiacyMs); (132) 
where Ri,,4,) represents a rotation by x about the Wz axis. Apart from a few 
general observations, little can be said about the form and properties of the radia- 
tive torque kernel, which differ from grain to grain. At very long wavelengths — 
that is, long compared to the size of the dust grain — it is a good approximation 
to treat the grain as a dipole (as one does for Rayleigh scattering). But in the 
dipole approximation there are no torques. Therefore, (a/A) serves as an expansion 
parameter for which the torque approaches zero as this parameter tends toward 
zero. Likewise a spherical grain gives a vanishing torque at any frequency as the 
result of symmetry arguments. 

Convolving the incident radiation intensity I(v,%,) with this kernel, we obtain 
the angle-averaged torque Ttot,raa(fz). If there were no magnetic field, we could 
incorporate the torque above into the previously derived stochastic differential equa- 
tion for the evolution of L [Eq. (103)] to obtain 


Lt) | L(t) 
| T tot,rad ( 


L(t) = BL) + BLED Ti) tin (138) 


where a coherent torque in body coordinates has been included as well. The above 
equation describes a random walk L(t) in three-dimensional angular momentum 
space. Owing to the stochastic random torque noise term, the rotational degree of 
freedom loses memory of its initial state after some characteristic decay time tran, 
after which its state can be described by a probability distribution p(L) calculable 
based on the above equation. The stochastic equation above has the general form 


L = v(L) — n(t), (134) 
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for which the associated Fokker—Planck equation describing the evolution of the 
probability density is 


Op(L) 
ot 
With the magnetic field included, we add the magnetic precession term to obtain 
L(t) L(t) 
t tot,rad 


|L(t) aa) + YmagL xB + Hran(t); (136) 


where the constant Ymag relates the magnetic moment to the angular momentum 
according to Up = Ymagh. 

When tran, trad > tprec, it is possible to reduce the dimension of the stochastic 
differential equation to two dimensions, namely the angular velocity w and the angle 
8px, between the vectors L and B, obtaining 


Oar = F(Opr,w) +noe(t), w= G(OBr,w) +n.(t). (137) 


= V’p(L) + V- (p(L)v(L)). (135) 


L(t) = —BL(t) + BL 


Unfortunately, very little can be said about the form of the functions F'(@g1,w) and 
G(@p1,w), which depends on the detailed grain shapes and other properties. Draine 
and Weingartner®?:>+??? studied three possible irregular grain shapes (shown in 
Fig. 25), calculating numerically the radiative torque as a function of illumination 
direction and frequency. Then assuming a certain anisotropic illumination, they 
calculated the functions F' and G in Eq. (137) and studied the solutions to this 
equation, numerically establishing the orbits in the absence of noise. Their simula- 
tions conclude that radiative torques can provide a plausible alignment mechanism. 

The proposal that radiative torques resulting from anisotropic illumination align 
the interstellar dust grains with the magnetic field offers a promising alternative 
to the Davis—Greenstein magnetic dissipation mechanism whose characteristic time 
scale may be too long to provide the needed degree of alignment. However, its 
viability is hard to evaluate given our ignorance concerning the shapes of the dust 
grains. Without such shape information, the best case that can be put forward 
in favor of this scenario based on radiative torques and anisotropic illumination is 
to show that some candidate shapes exist that give the right orders of magnitude 
combined with the argument that the Davis—Greenstein scenario cannot account for 
the data. It is discouraging that it is not possible to formulate precise predictions 
for the radiative torque scenario. The origin of grain alignment seems to remain an 
open question. 


19.10. Small dust grains and anomalous microwave emission 
(AME) 


The thermal emission of interstellar dust grains discussed above was based on the 
assumption that its thermal state may be characterized by a mean temperature 
(around 20K) and that fluctuations about this temperature subsequent to the 
absorption of an UV photon constitute a negligible correction. If a dust grain is 
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Fig. 25. A selection of grain shapes for the study of radiative torques. In Ref. 53, Draine and 
Weingartner calculated the radiative torque on a dust grain as a function of frequency v and 
incident photon direction fi for the three grain shape illustrated above. (Credit: B. Draine and J. 
Weingartner) 


sufficiently large, this is a good approximation, but for small molecules an FUV 
(far ultraviolet) photon can raise the grain temperature momentarily by an order 
of magnitude or more, allowing emission in bands where there would be almost no 
emission for a grain at its average temperature. The bulk of the energy absorbed 
from the UV photon is then rapidly re-emitted through IR fluorescence as the 
molecule rapidly cools back down to its mean temperature. Hints of such emis- 
sion requiring a temperature significantly above the mean temperature were noted 
through the observation of aromatic infrared emission features consistent with C-— 
H bonds on the edge of an aromatic molecule — that is, around 3.3 and 11.3 
microns.°°”!!8 Later, analysis of the IRAS dust maps showed that a significant frac- 
tion of the UV photons absorbed through extinction were re-emitted in bands of 
wavelength shorter than 601, where under the assumption of a reasonable aver- 
age temperature and a modified blackbody law there should be no emission.!5° 
As a consequence, the dust models were revised to extend the size distribution 
to include small grains thought to be PAHs as a result of the infrared emission 
features described above. PAHs (polycyclic aromatic hydrocarbons) are essentially 
small monolayer planar sheets of graphite with hydrogen frills on their edges. 

A template for the small grain population can be constructed using maps of 
infrared emission at \ < 60. where the thermal emission from large grains does not 
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contribute. Since the short wavelength emission from the dust grains is proportional 
to the local FUV flux density, these templates can in principle be improved by 
dividing by the local FUV flux density to obtain a better map of the projected 
small dust grain column density, which in turn can be correlated with the AME 
(anomalous microwave emission) at low frequencies. 

It was also pointed out that such a population of very small dust grains would 
be expected to rotate extremely rapidly, giving rise to electric dipole emission at 
its rotation frequency and possibly at higher harmonics, since P. ~ pgw* and 
Ww (kp Tiot) / ma? in the absence of suprathermal rotation.°°°>! The possibility 
of magnetic dipole emission from small rotating grains has also been investigated.°? 

Observationally, emission at low frequencies, where diffuse thermal dust emission 
should be minuscule, was found correlated with high frequency dust maps.4?:106-119 
Such emission would be consistent with dipole emission from small rapidly rotating 
dust grains, as described above. As an alternative explanation for such a correlation, 
it was proposed that regions of free-free emission could be correlated with regions 
of thermal dust emission. However, this low frequency excess was found not to be 
correlated with Ha line emission, which serves as a tracer of free-free emission. One 
possibility to weaken such a correlation would be to postulate an extremely high 
temperature, so that Tree-tree(Z)/'7a is large. However, it was found that such 
a high temperature would require an implausible energy injection rate to prevent 
this phase from cooling. 

The spectrum of anomalous microwave emission, presumably from spinning 
dust, has been measured by taking spectra in targeted regions having a large con- 
tribution from AME and subtracting the expected synchrotron, free—free, ther- 
mal dust, and primordial CMB signals.8°??° Figure 26 shows the spectrum thus 
obtained. The observed fall-off at low v disfavors the hypothesis of free—free emission 
correlated with thermal dust emission as the origin of the anomalous emission, 
because in that case, the spectrum of the residual should rise as v is lowered. 

Most theoretical studies have assumed that the small grains do not become 
aligned with the galactic magnetic field, but alignment mechanisms for this grain 
population have not yet been investigated in detail. Observational limits on the 
anomalous microwave emission constrain the degree of polarization to less than a 


few per cent.!% 


20. Compact Sources 


The above processes pertain primarily to diffuse galactic emission, which contam- 
inates the primordial CMB signal most severely at low multipole number ¢. On 
smaller scales, however, compact sources become more of a concern and eventually 
become the dominant contribution at all frequencies. We use the term “compact” 
to denote both unresolved point sources and partially resolved, localized sources. 
The angular power spectrum of point sources distributed according to a Poisso- 
nian distribution would scale with multipole number as @°. If ignoring the structure 
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Fig. 26. Consistency of anomalous microwave emission with spinning dust. The multi-frequency 
observations shown here were taken for a molecular cloud believed to have strong anomalous dust 
emission. We observe that a linear combination of thermal dust and free—free emission cannot 
explain the observed shape. However, if an anomalous dust component with the spectral shape 
predicted by Draine and Lazarian for electric dipole emission from spinning dust is included (dotted 
curve), a reasonable fit may be obtained. For more details, see Ref. 220. (Credit: COSMOMAS 
Collaboration) 


of the acoustic oscillations, one approximates the primordial CMB spectrum as 
having the shape £~?, one would conclude that compact sources rapidly take over 
at high @. Because of Silk damping combined with the smoothing from the finite 
width of the last scattering surface, the fall-off of the CMB at large @ is even more 
rapid. The fact that point sources are clustered rather than distributed in Poisso- 
nian manner causes the point source power spectrum to be clustered, leading to an 
excess of power at small £ compared to an £° power law. 


20.1. Radio galaxies 


Section 16 described the physical process of synchrotron emission, emphasizing 
its contribution to the diffuse emission from our own galaxy. However the same 
process is also at work in galaxies other than our own, and in particular in the rare 
but highly luminous radio galaxies at high redshift, extreme processes fueled by a 
central black hole give rise to intense emission in many bands, including synchrotron 
emission in the radio and microwave bands. For our purposes, the most salient 
and relevant property of these sources is their inverted spectrum. For a detailed 
survey of radio sources, see Ref. 212. Since each source differs slightly in its spectral 
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properties, the strategy for dealing with these sources is to mask out the brightest 
sources and then model the residual arising from the remaining unmasked sources. 


20.2. Infrared galaxies 


In Sec. 18, we discussed at length the contamination from thermal emission from the 
interstellar dust in our galaxy. Thermal dust emission, however, is not restricted to 
our galaxy. All galaxies contribute some thermal dust emission. However, the galax- 
ies at high redshift in which star formation is occurring at a rapid rate contribute 
predominantly to the extragalactic thermal dust emission in the microwave bands 
of interest to us. In fact, most of the energy of the radiation from these galaxies 
in which intense star formation is taking place is emitted in the infrared, because 
the light and UV emitted is absorbed by dust, which becomes heated and in turn 
re-emits in the infrared. 

Some of these infrared galaxies are resolved by current observation. Others 
remain unresolved by present day observations and these are attributed to the 
diffuse infrared background. These infrared galaxies do not all emit with the same 
frequency spectrum, in part because they are spread over a wide range of redshifts. 
For the early discovery papers on the CIB (cosmic infrared background) from COBE 
see Refs. 187, 85 and 63. For more recent observations from Planck, see Ref. 155. 


21. Other Effects 
21.1. Patchy retonization 


In Sec. 7.5, we idealized the reionized hot electron gas extending from intermediate 
redshifts to today as being isotropically distributed so that the free electron density 
depends only on redshift. Such a reionization scenario is implausible but provides 
a good approximation for the effect of reionization on the CMB power spectrum at 
low £. In a more realistic reionization scenario, the universe becomes reionized in 
an inhomogeneous manner. As the first stars and quasars form, their UV radiation 
creates around them spheres of almost completely ionized hydrogen (i.e., HII) that 
grow and eventually coalesce. These spheres have sharp edges that act to trans- 
form large-angle power from the primordial CMB into small-scale power. For some 
estimates of the impact due to patchy reionization of the angular CMB spectrum, 
see Ref. 253. 


21.2. Molecular lines 


The sources of diffuse galactic emission discussed so far all have a broad continuum 
emission spectrum. But there also exist molecular lines, mostly from rotational 
transitions of molecules, that emit in the microwave bands of interest here. The 
brightest of these are the CO lines,?° of which the lowest is situated at 115.3 GHz. 
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Since CO is a linear molecule, there are also transitions at integer multiples of 
this frequency. Removing CO emission from the Planck data has proven to be a 
challenge.'®? For more sensitive future experiments, it is likely that other lines will 
also present concerns. 


21.3. Zodiacal emission 


Besides the planets and asteroids, our solar system is filled with large dust grains, 
known as zodiacal dust, which on the average are much larger than the interstel- 
lar dust grains described above. These grains manifest themselves by a variety of 
means. In the visible they scatter light from the sun, and this background must be 
subtracted when analyzing diffuse emission in the optical and near infrared bands. 
The zodiacal dust is concentrated around the ecliptic plane. It is a good but not per- 
fect approximation to model the distribution of zodiacal dust assuming azimuthal 
symmetry. The temperature of the zodiacal dust is much above that of the inter- 
stellar dust and roughly proportional to the inverse square of the distance from the 
sun. Subtracting zodiacal emission is challenging because its contribution to a given 
point on the celestial sphere depends on the time of year when the observation is 
made, because observations at different times of the year look through different 
columns of dust, even when azimuthal symmetry is assumed. The ideal observing 
program for characterizing the zodiacal dust would cover the whole sky many times 
over the year in order to accumulate as many lines of sight to the same point on the 
celestial sphere as possible and to carry out a sort of tomography. For a discussion 
of zodiacal emission at long wavelengths as measured by Planck, see Ref. 164. For 
previous work on zodiacal emission in the far infrared, see Ref. 64. 


22. Extracting the Primordial CMB Anisotropies 


Most of the contaminants described above, with the exception of gravitational lens- 
ing and the kinetic SZ effect, have a frequency dependence that differs from that 
of the primary CMB anisotropies. This property can be exploited to remove these 
nonprimordial contaminants and lies at the heart of all component separation tech- 
niques. If there are N distinct components, each of which is characterized by a 
fixed frequency spectrum, in the absence of measurement noise, a perfect separa- 
tion of these components is possible using sky maps in N frequency bands. With 
more bands, the separation is overdetermined allowing validation of the model. 
The component separation strategy sketched above, which may be described as a 
linear method, is based on a linear model, but this is not the only way to clean 
maps. More sophisticated nonlinear models may be formulated, often based on or 
inspired by Bayesian statistics that exploit other properties such as the positivity 
of certain components, spatial variation of spectral indices, and so on. Some meth- 
ods are characterized as being “blind,” meaning that the number of components is 
indicated by data itself, while other methods are more physically based. 
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Because of limitations of space, we do not go into the details or practicalities 
of component separation, instead referring the reader to a few excellent reviews 
for more details. For an early discussion of the component separation problem and 
various approaches, see Refs. 24 and 44. Reference 116 includes a comparison of 
methods used for the Planck analysis. Reference 56 describes foreground removal 
challenges for future CMB B mode polarization experiments and includes extensive 
references to the literature. 


23. Concluding Remarks 


Prior to the discovery of the CMB anisotropy in 1992 by the COBE team, cosmol- 
ogy was a field with a questionable reputation. Many from other fields described 
cosmology as a field where theorists were free to speculate with almost no obser- 
vational or experimental constraints to contradict them. Much was made of the 
supposed distinction between an “observation” and an “experiment.”” However, 
between 1992 and today cosmology has transformed into a precision science where 
it has become increasingly difficult to propose new models without contradicting 
present observation. Ironically, much science having only a tenuous connection to 
cosmology is now being explained to the public as serving to probe the big bang. 

In this review, we have tried to sketch the most important aspects of the physics 
of the CMB in a self-contained way. Other reviews with a different emphasis include 
Refs. 94, 255 and 256. We have seen how the CMB provides a snapshot of the state 
of the universe at around z = 1100, when the primordial perturbations were still 
very nearly linear, and how the precise angular pattern on the sky is also sensitive 
to a number of other cosmological parameters, which can be determined in many 
cases to the percent level or better. We have also explored tests of non-Gaussianity. 
At present no non-Gaussianity of a primordial origin has been observed, ruling out 
many nonminimal models of inflation, in particular multi-field models developed 
to explain hints of non-Gaussianity in the WMAP data, which subsequently were 
excluded by Planck. 

We also saw how better measurements of the CMB absolute spectrum could 
reveal deviations from a perfect blackbody spectrum, some of which are to be 
expected and others of which would constitute signs of novel physics, such as energy 
injection at high-redshift. The absolute spectrum remains a promising frontier of 
CMB science. 

The other promising area of CMB research is the characterization of the polar- 
ized sky, in particular to search for primordial B modes from inflation, which con- 
stitute the most unique prediction of inflation. Maps of B modes on smaller angular 
scales would also allow a characterization of the gravitational lensing field at much 


WIt is not clear whether proton—antiproton colliders would be classified as constituting an “exper- 
iment” under this criterion, because the experimenter has virtually no control over parameters of 
the hard parton—parton collisions of interest due to the broad spread in x of the parton distribution 
functions. 
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greater precision than is possible from observations of the temperature anisotropy. 
Such data would provide an important cross check of weak lensing surveys using 
the ellipticities of galaxies and provide a unique probe of absolute neutrino masses, 
perhaps allowing one to distinguish the normal from the inverted mass hierarchy. 

Since this volume is dedicated to Albert Einstein and the hundredth anniversary 
of his celebrated paper presenting the general theory of relativity, we close with a 
few remarks about Einstein, and in particular about his relation to the subject of 
this chapter. Einstein is remembered by today’s generation of physicists mostly for 
his role in the development of the special and general theories of relativity. This 
part of Einstein’s work sometimes overshadows Einstein’s numerous achievements 
in other areas and the extreme breadth of his scientific interests. Einstein is often 
viewed today as the paragon of the abstruse, highly mathematical theorist. Many 
physicists today would be surprised to learn that Einstein patented a design for an 
ammonia—butane refrigerator.°? Others might be surprised to learn that Einstein 
is the author of one experimental paper for the work after which the Einstein—de 
Haas effect is named. This effect may be thought of as the converse of the Barnett 
effect, which we saw in Sec. 19.5.1 plays a crucial role in the alignment of interstellar 
dust grains. Moreover, the stochastic differential equations in that section bear a 
close relation to Einstein’s work on Brownian motion. General relativity was once 
regarded as an abstruse, highly mathematical subfield of physics. Within astro- 
physics, relativity was first introduced through the emergence of a subfield known 
as “relativistic astrophysics.” But nowadays general relativity pervades almost all 
areas of astrophysics and has become one of the elements of the underlying fun- 
damental physics that every astrophysicist must know. One can no longer neatly 
separate “relativistic astrophysics” from the rest of astrophysics. 

This chapter told the story of the big bang cosmology from a point of view 
emphasizing the observations of the relic blackbody radiation. Einstein’s theory 
of general relativity culminating in his celebrated 1915 formulation of the grav- 
itational field equations*’” enters as a key ingredient to this story in two ways. 
First there are the unperturbed spacetime solutions now known as the FLRW solu- 
tions. This family of spatially homogeneous and isotropic solutions to the Einstein 
equations was first found by Alexandre Friedmann in 1922 and independently by 
Georges Lemaitre in 1927. At that time, the solutions were surprising because of 
their prediction of a non-eternal dynamical universe whose natural state was either 
expansion or contraction, an idea that was before its time. It was not until 1929 
that Hubble discovered that the recessional velocities of distant galaxies except for 
some noise vary approximately linearly with distance (although in its first version 


xA. Einstein, “Die Feldgleichungen der Gravitation,” Sitzungsberichte der Konigliche Preussiche 
Akademie der Wissenschaften, p. 844 (25 November, 1915). 

YFor an authoritative account from a scientific perspective of the path involving over a dozen 
papers and several reversals that finally led Einstein to the gravitational field equations as we 
know them today (i.e., Gury = Ruv — SIR = (87Gn )Tyv), see Abraham Pais, Subtle is the 
Lord (New York, Oxford University Press, 1982). 
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the data was not statistically compelling). Einstein’s general theory was deduced 
by drawing mainly on considerations of mathematical simplicity and elegance and 
on the equivalence principle, but remarkably applied to the expanding universe 
through the FLRW solution and its linearized perturbations. General relativity 
remains the basis of cosmology despite the great improvement in the data that has 
taken place between the 1920s and now. Today there is much talk about “modified 
gravity,” largely motivated by the discovery of the so-called “dark energy,” which 
no one yet really understands. It can be argued whether the dark energy belongs to 
the left-hand side of the Einstein field equation, in which case it would constitute a 
revision to Einstein’s theory, or on the right-hand side, in which case it would sim- 
ply constitute a new form of exotic stress—energy, leaving the gravitational sector 
unchanged. But so far Einstein’s theory of relativity as a classical field theory has 
survived without change. To be sure, we believe that gravitational theory as formu- 
lated by Einstein is an incomplete story. Where quantum effects become relevant, 
we know that there must be a more complete theory of which classical relativity is 
but a limiting case. But we do not yet know what that larger theory is. In the clas- 
sical regime, Einstein’s theory has survived many stringent tests, and perhaps more 
importantly has become an indispensable foundation for our current understanding 
of the universe and its origins. 
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Note added in Proof: Since the writing of this chapter, the Planck 2015 Cosmo- 
logical Results have appeared in the form of a series of papers of which a complete 
listing can be found in the overview article.2°° Apart from an improved analysis 
of the temperature anisotropies, this release contains for the first time a detailed 
analysis of the Planck TE and EE polarization results (except at the lowest multi- 
pole numbers, which will be analyzed as part of the 2016 final release). The joint 
BICEP/Keck Array—Planck analysis of the B mode signal in the BICEP2 patch cov- 
ering about a square degree of the sky resulted in a new upper limit on the tensor- 
to-scalar ratio of (r < 0.12) at 95% confidence.?® This joint analysis exploited 
the Planck high-frequency maps dominated by polarized dust emission to clean the 
dust contribution from the BICEP2/Keck Array maps at 150 GHz. This new limit 
marks the first time that a limit on r from B modes alone is competitive with limits 
obtained from the temperature anisotropy. It should be stressed that the limits 
from B mode polarization alone are much more robust against model assumptions 
than limits obtained using the temperature anisotropy for the reasons explained in 
the main text. 
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Type Ia supernovae (SNe Ia) luminosities can be corrected in order to render them useful 
as standard candles that are able to probe the expansion history of the universe. This 
technique was successfully applied to discover the present acceleration of the universe. 
As the number of SNe Ia observed at high redshift increases and analysis techniques are 
perfected, people aim to use this technique to probe the equation-of-state of the dark 
energy (EOSDE). Nevertheless, the nature of SNe Ia progenitors remains controversial 
and concerns persist about possible evolution effects that may be larger and harder to 
characterize than the more obvious statistical uncertainties. 


Keywords: SNe Ia; dark energy. 


PACS Number(s): 97.60.Bw, 98.80.—k, 95.36.+x 


1. Introduction 


“Guest stars”, the name given to supernovae by ancient Chinese astronomers, have 
been recorded for several thousands of years, but the first modern study of super- 
novae began on 31 August 1885. On that day, Hartwig discovered a “nova” in M31, 
which disappeared 18 months later. In 1919, Lundmark! noticed that the distance 
to M31 is about 0.7 million ly (later studies found it to be actually 2.5 million ly), 
which implies that the Hartwig “nova” is brighter than other novae by three orders 
of magnitude. The term “supernova” was coined shortly after. Later, it was realized 
that supernovae can be a powerful tool for measuring the distance to extragalactic 
sources. However, for want of detailed enough observed spectra, supernovae were 
not classified until 1940.2 With the advent of consistently accurate observed spectra, 
it was finally realized that there are two physically distinct classes of supernovae: 
thermonuclear supernovae, i.e. Type Ia supernovae (SNe Ia), and core collapse 
supernovae, including SN IIP, IL, IIn, IIb, Ib and Ic.* Type I supernovae are dis- 
tinguished by the absence of hydrogen lines in their optical spectra. SNe Ia are 
defined by a deep absorption trough around 6150A, which is now known to be due 
to silicon, additionally.® 
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Due to the homogeneity of their properties, SNe Ia are often used to measure 
distance. Here, we will review the methods commonly used to standardize SNe Ia 
for this purpose in Sec. 2, and the progenitor problem in Sec. 3. In Sec. 4, we discuss 
the effect of different populations of SNe Ja on their brightness. In Sec. 5, we will 
present the application of SNe Ia in cosmology, i.e. the discovery of accelerating 
cosmic expansion, and the efforts to probe the equation-of-state of the dark energy 
(EOSDE). Finally, we summarize the future promise for SNe Ia as a probe to 
precision cosmology in Sec. 6. 


2. SNe Ia as a Standardizable Distance Candle 


Generally, for two nearby objects (z < 1) with astronomy apparent magnitudes m, 
and m2, an Euclidean spatial geometry can be assumed, the ratio of their apparent 
brightness F, /E2, is related by the following equation: 


EL mg=my 

- 10073, (1) 
which indicates that if the apparent magnitude of one object is lower than that of 
another one by 5 magnitudes, the object is brighter than the other by 100 times. 
For the same object, its apparent magnitude at 10 pc is defined as the absolute 
magnitude, which may reflect its intrinsic luminosity. Assuming the absence of 
extinction by dust, its apparent magnitude, m, and absolute magnitude, M, follow 
the equation: 


gp 7 0 = pF (2) 


where Fo is the apparent brightness at 10 pc, EF the observed brightness, and d 
the distance to the object in units of parsecs. Equation (2) basically expresses the 
inverse-square law between apparent brightness and distance. Therefore, apparent 
magnitude, absolute magnitude and distance are related by the following equation: 


M=m-+5-— 5logd, (3) 


and m — M = 5logd — 5 is called distance modulus. As we shall discuss later 
in Sec. 5, at larger redshifts, for which z < 1 does not hold, the relation between 
apparent and absolute magnitudes is no longer so simple and depends on the details 
of the cosmological model assumed. If there is a method to independently obtain 
the absolute magnitude of an object, the distance may be obtained by measuring its 
apparent magnitude, like Eq. (3), and then this object may be taken as a standard 
candle to measure distance. SNe Ia are excellent candidates for standard candles 
after being corrected by some methods. 

Among all the subclasses of different supernovae,® SNe Ia are the most homoge- 
neous, with many practically identical properties, e.g. lightcurve shape, maximum 
luminosity, spectrum, and as a result, SNe Ia were taken to be perfect standard can- 
dles.* However, 1991 was a fateful year for SNe Ia studies, two peculiar SNe Ia were 
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Fig. 1. The B-band lightcurves for different SNe Ia observed. (For color version, see page II- 
CP10.) 
Source: From Ref. 9. 


found, SN 1991bg and 1991T. SN 1991bg was fainter than normal SNe Ia by about 
2 magnitudes in the V-band,°>© while SN 1991T was brighter than normal SNe Ia 
by 0.4 magnitudes.” The discovery of these two peculiar SNe Ia implies a distance 
error of about 30% if SNe Ia are assumed to be perfect standard candles. As shown 
by the collection of different supernova lightcurves in Fig. 1, different SNe Ia have 
different peak brightnesses. It became a matter of critical importance to find a way 
to reduce the distance error originating from this heterogeneity of SNe Ia.!°6 In 
1993, Phillips!? discovered that the absolute magnitude at maximum light of SNe 
Ia and the speed at which the luminosity fades in the B-band (blue light) over the 
first 15 days after the maximum light are related, as shown in Fig. 2. This relation 
implies that the brightness of SNe Ia is mainly dominated by one parameter, and 
it is widely agreed that this parameter is the amount of °°Ni produced during the 
supernova explosion that determines its maximum luminosity. Actually, one may 
arrive at another conclusion from Fig. 2, that the intrinsic magnitude dispersion of 
SNe Ia in the I-band is smaller than those in the B and V bands, i.e. the infrared 
measurement of SNe Ia may yield a more precise distance. However, several years 
later, after increasingly dim SNe Ia were included, the linear relation was found 
to be a quadratic or an exponential relation as shown in Fig. 3.1'!? Although this 
relation is widely accredited to Phillips, it was originally discovered by Rust!® and 
Pskovskii,'+:'° who noticed the correlation between the maximum light of SNe Ia 
and their light decline rate. 
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Fig. 2. The correlation between the absolute magnitudes of SNe Ia in B, V and I bands and 
Am15(B) which measures the B-band lightcurve drops during the first 15 days after the maximum 
light. 

Source: From Ref. 10. 


Following the discovery by Phillips,!° several groups developed further improved 
methods to reduce the distance error, and SNe Ia were not a standard candle 
any more, but a distance indicator. Most notably, Riess et al.'®!7 developed the 
multicolor lightcurve shapes (MLCS) method, which is known as the most math- 
ematically sound method for inferring a distance. The method accounts for the 
nonuniform error on the distance from the effects of different bands and different 
supernova colors. In this method, a “training set” of several Type Ia supernova 
(SN Ia) lightcurves was constructed, and a complete model of the lightcurves is 
dependent on three parameters, i.e. a distance modulus, a brightness offset which is 
called “luminosity correction” and an extinction value. To calibrate the brightness 
of the training set, the distance modulus and the extinction value are obtained by 
independent methods. Actually, the brightness offset reflects that an SN Ia with 
a broader lightcurve has a higher peak luminosity, i.e. the Phillips relation. The 
reason that a correction was made for color is that redder SNe Ia are generally 
less luminous, both in intrinsic terms and for dust reddening considerations.!®18 
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Fig. 3. Modified Phillips’ relation with the so-called peculiar SNe added (filled points). The solid 
line is an exponential fit for all the data. 
Source: From Ref. 12. 


Compared to the Phillips relation, although the MLCS method does not signifi- 
cantly increase the precision of distance measurements, the method may be the 
most complete and rigorous one mathematically. Figure 4 shows the typical disper- 
sions of light and color curves after correction by the MLCS method, and from the 
figure, we can see that the SNe Ia can act as very good distance indicators, because 
the dispersion at peak brightness is very small. Almost at the same time, Perlmutter 
et al.!9:?° developed another tool, named the stretch factor method. This method 
has a similar distance precision compared with the MLCS method, as shown in 
Fig. 5, where the data are the same to those in Fig. 1. Again, SNe Ia are proven to 
be potent distance indicators. It is worth noting that the MLCS method and the 
stretch factor method essentially take advantage of the same underlying phenomena 
as that underlying the Phillips relation, i.e. the slower the lightcurve evolves, the 
brighter the SN Ia. To obtain the parameters crucial to the implementation of these 
methods, various algorithms have been developed to fit the lightcurves of SNe Ia 
data, such as BATM, MLCS2k2, SALT2 and SiFTO.?! 6 
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Fig. 4. Dispersions of noise-free light and color curves after correction by MLCS method, where 
the “gray snake” present 1o-confidence region. 
Source: From Ref. 16. 


Other research groups have also tried to find other methods to obtain more 
accurate SNe Ia distance measurements. For example, in 1995, Nugent et al.2” 
noticed a linear relation between the maximum luminosity of an SN Ia and the 
ratio of the fractional depths of the bluer to that of the redder absorption troughs 
of its silicon absorption line, or the flux ratio over the Ca I, H and K absorption 
features in a spectrum near maximum light (see also Refs. 28 and 29). SNe Ia 
were therefore calibrated to be practical standard candles, and have been applied 
successfully for cosmological purposes, ultimately leading to the discovery of the 
accelerating expansion of the universe.!%?° 

The above methods generally decrease the distance error to 5—10%. However, 
for a well-observed sample, the typical dispersion on a Hubble diagram may be 
as small as 0.12 magnitudes, indicating a typical distance accuracy of 5% for a 
single SN Ia. In 2005, Wang et al.°° found that the peak brightness of SNe Ia and 
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Fig. 5. The lightcurves for different SNe Ia after correction by stretch factor method, where the 
data are the same to that in Fig. 1. (For color version, see page IH-CP10.) 
Source: From Ref. 9. 


their B—V colors at 12 days after peak B-band luminosity were strongly correlated. 
Calibrating SNe using this correlation, the typical error of the SNe Ia on a Hubble 
diagram can be reduced to less than 0.12 magnitudes. For a typical sample of SNe 
Ta with little reddening from host galaxies, the scatter may drop further to only 0.07 
magnitudes, or 3-4% in distance.°°! Here, it must be emphasized that for most 
cases, the observational error is not mainly due to imperfections in the observational 
apparatus, but rather from a lack of uninterrupted telescope time, i.e. it is difficult 
to obtain a complete lightcurve for each single object. Recently, in addition to the 
lightcurves and the colors of the SNe Ia, some spectra properties were found to be 
helpful for improving the accuracy of derived distances to SNe Ia.?8:29:32:33 

For precision cosmology, within most redshift intervals, systematic errors from 
the above correlations dominate over the statistical errors, the latter of which can 
be dealt with by increasing the size of an observational sample.34 36 The dominant 
systematic uncertainties for SNe Ia are survey-dependent. As far as what the uncer- 
tainties are and how they can be improved are concerned, one may refer to relevant 


detailed reviews.27 89 


3. Progenitors of SNe Ia 


In 1960, Hoyle and Fowler*? suggested that a degenerate white dwarf (WD) may 
ignite a runaway thermonuclear fusion in its center, and the released energy may 
disrupt the whole WD, resulting in an SN Ia explosion. Owing to a series of break- 
throughs on the subject since then, it has been indisputably proven that all SNe 
Ia arise from such explosions of carbon—oxygen white dwarfs (CO WDs) in binary 
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systems, of which the most substantial evidence is derived from the observation of 
SN 2011fe.4! For a CO WD to undergo such an explosion, it must reach or exceed 
a mass of about 1.4 Mo, henceforth referred to as the Chandrasekhar mass limit. 
The limit was named after Subrahmanyan Chandrasekhar, who was the first person 
to work out that there was a maximum mass for a WD to be supported by electron 
degeneracy pressure. He did this by combining quantum theory with relativity, and 
it was the first time to make a quantitative prediction about the maximum mass 
of WDs. In 1983, he was awarded the Nobel Prize in Physics mainly due to the 
discovery of the mass limit of WDs. 

However, the mass of a CO WD at birth cannot be more than 1.1 Mo,*2*3 
according to canonical stellar evolutionary theory, and therefore the CO WD must 
accrete additional material somehow (i.e. from its companion in a binary system), 
and gradually increase its mass to the aforementioned maximum stable mass before 
it can achieve this state and undergo an SN explosion. The details of this accretion 
process will be discussed later. During an SN Ia explosion, nearly half of the mass 
involved is depleted and synthesized into radioactive °°Ni, the radioactive energy 
of which is then injected into the supernova ejecta, heating it to the point where it 
becomes very luminous, ultimately resulting in the emissions which we observe.*+ 
The amount of °°Ni resulting from an SN explosion is the dominant determining 
factor behind the maximum luminosity of the SN Ia.4° However, despite all that 
we know, there still remain quite a few unresolved problems where SNe Ia are 
concerned. To name but a few: what are the progenitors of SNe Ia, or how does a 
CO WD increase its mass to the Chandrasekhar limit? How exactly does the WD 
explode? Do the basic properties of SNe Ia remain constant with redshift? Among 
all these basic problems, the most urgent is the one regarding the progenitors of SNe 
Ia, which can potentially affect the precision of distance measurement via SNe Ia, 
and hinder the development of precision cosmology. More specifically, the progenitor 
problem is important, for cosmology as well as for other areas of astrophysics, in 
the following ways. (i) When measuring the cosmological parameters with SNe Ia, 
one needs to know of any possible evolution of the luminosity and birth rate of SNe 
Ia as a function of redshift, which is mainly determined by the progenitor model. 
(ii) The progenitor model provides basic input parameters for explosion models of 
SNe Ia. (iii) Galactic chemical evolution models require some basic parameter input, 
such as the stellar feedback from SNe Ia, both chemical and radiative. These basic 
input parameters are closely related to the progenitor models of SNe Ia. (iv) The 
identification of the progenitor systems of SNe Ia may constrain binary evolution 
theory“? 

Over the course of the last three decades, several progenitor models were dis- 
cussed by many research groups. These models may be categorized based on the 
nature of the proposed companion of the CO WD, and we list the most notable 
ones below. (i) The single degenerate (SD) scenario®®-*!: In this scenario, the com- 
panion may be a main sequence star, a subgiant, a red giant or a helium star.5?->” 
The accreted hydrogen-rich or helium-rich material undergoes stable burning on the 
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surface of a CO WD, yields carbon and oxygen, which is then deposited onto the 
WD. When the WD mass reaches some value approaching the Chandrasekhar mass 
limit, an SN Ia is produced. The companion survives after the supernova explo- 
sion. This scenario is widely accepted and is the leading scenario. (ii) The double 
degenerate (DD) scenario: In this scenario, a binary system comprising two CO 
WDs loses its orbital angular momentum by means of gravitational radiation, and 
merges as a consequence. If the total mass of the merger exceeds the Chandrasekhar 
mass limit, an SN Ia may be expected.°® ®! In this scenario, the merger is disrupted 
completely and the companion does not survive. Note, however, it is possible that 
the merger product may experience a core collapse rather than a thermonuclear 
explosion.®*-°3 The SD and DD scenarios are the two main competing scenarios in 
the SN Ia community. Generally, to form an SD system leading to an SN Ia, the 
primordial binary system needs to experience one common envelope phase, while to 
form a DD system, the situation is more complicated and one extra common enve- 
lope phase can be necessary. For example, in Fig. 6, three subchannels lead to a WD 
+ main sequence system ending in an SN Ia explosion, and for every subchannel, 
the system experiences a single common envelope phase only (see details in Wang 
and Han**), 


Close Binaries 
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Fig. 6. Schematic for the subchannels producing WD + MS systems, which may lead to SNe Ia. 
For every subchannel, the binary system experiences a common envelope phase. (For color version, 
see page II-CP11.) 

Source: From Ref. 48. 
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In addition to the models mentioned above, the double-detonation model (sub- 
Chandrasekhar mass model) is also frequently discussed. In this model, the com- 
panion of the CO WD is a helium WD or a helium star, which fills its Roche lobe 
and initiates a stable mass transfer. If the mass-transfer rate is not high enough, 
the helium material may not burn stably, and as a consequence gradually accu- 
mulates on the CO WD. When the helium layer is massive enough, a detonation, 
which is a supersonic blast wave, is ignited at the bottom of the helium layer, in 
which the critical mass to ignite a detonation is generally dependent on the WD 
mass and the mass-transfer rate.°4 At the moment of ignition, WD is generally 
less massive than the Chandrasekhar mass limit. The resulting detonation wave 
compresses the inner CO material, leading to a second detonation near the core of 
the CO WD.® © After the supernova explosion, a low mass helium star or a low 
mass helium WD may survive to form a hypervelocity star, whose velocity is high 
enough to exceed the escape velocity of its parent galaxy.°* Remarkable progress 
has been made on this model recently, though many problems still exist.©”° If this 
model does indeed produce SNe Ia, it would contribute quite significantly to SNe Ia 
rates.’+ 

Besides the SD and DD scenarios and the double detonation model, other similar 
models have been proposed to explain the diversity generally observed among SNe 
Ta, such as the super-Chandrasekhar mass model, the single star model, the spin- 
up/spin-down model, the delayed dynamical instability model, the core-degenerate 
model, the model of a collision between two WDs, and the model of WDs near 
black holes, etc. (See Wang and Han*® for a review.) At present, it is premature to 
exclude any of these models conclusively, and no single model can explain all the 
properties of SNe Ia alone. It is possible that many of these models, or at least both 
the SD and DD scenarios, contribute to SNe Ia.48:49:72:78 


4. Effect of SN Ia Populations on Their Brightness 


When applying SNe Ia as cosmological standard candles, it is usually assumed that 
their properties and the above calibrated relations are invariant with redshift, i.e. 
the demographics of SNe Ia does not evolve with redshift, or the difference between 
SNe Ia arising from different stellar populations is negligible. This assumption is 
crucial for precision cosmology, since it may lead to systematic errors in the mea- 
surements of cosmological parameters. One way to test this assumption is to search 
for correlations between the properties of SNe Ia and those of their environments, 
since the environments of SNe Ia can represent the host population which gives rise 
to SNe Ia to a great extent. By the merit of these studies, it is now known that SN 
lightcurves and peak luminosities are correlated with host galaxy star formation 
rate, host population age, galaxy mass and metallicity.74”” 

Even before SNe Ia were applied to measure cosmological parameters, it was 
generally agreed upon that the brightest SNe Ia always occur in late-type galaxies.”® 
It should be noted, however, that both late- and early-type galaxies can host dimmer 
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SNe Ia, and subluminous SNe Ia tend to be discovered in galaxies with a significant 
old population,” which leads to a lower mean peak luminosity for SNe Ia in early- 
type galaxies than in their late-type counterparts.”*°° In addition, the mean peak 
brightness of SNe Ja in a galaxy is more homogeneous in the outer than in the 
inner regions.®!8? Also, both the birth rate and luminosity of SNe Ia trace the star 
formation rate of host galaxies.“*”"” All these observations seem to imply that age 
is one of the factors which affect the luminosity of SNe Ia, and that dimmer SNe 
Ia can arise from progenitors spanning a wide range of ages.”°:76-83 85 Considering 
that the typical galactic star formation rates increase by a factor of ~ 10 up to 
z = 1.5, one may expect that the luminosities of SNe Ia will generally increase 
with redshift, and it has been shown that the mean “stretch factor” increases as 
well.8¢-87 According to these observations, we may conclude that the average value 
of the maximum luminosities of SNe Ia, as well as the range over which they span, 
both decrease with their delay times. It can also be concluded that, should the 
maximum luminosity be determined by only one parameter, as postulated by the 
Phillips relation,!° it can be expected that the possible range of the parameter must 
decrease, and that its mean value must either increase or decrease with the age of 
SNe Ia. At present, although it has been widely accepted that the peak luminosity of 
SNe Ia is dictated by the quantity of °°Ni produced during their explosion, it is still 
controversial which parameter physically determines the °°Ni production.8®> The 
dimmer brightness for an old SN Ia could be either due to a high ignition density 


89,90 or due to a lower carbon abundance 


for a long cooling time before accretion, 
requirement for a massive WD with a less massive secondary under the frame of 
the SD model.®! %? In addition, the total mass of the DD systems was suggested to 
the origin of the brightness variation of SNe Ia,°”°4 but it seems not to fit the age 
constraints on the brightness of SNe Ia from observations.°° 
Metallicity is another factor affecting the luminosity of SNe Ia. 
oretically, high metallicity progenitors would produce subluminous SNe Ia: A high 
?2Ne abundance in high metallicity WDs generates more neutrons to feed the 
explosion nucleosynthesis, yielding more stable °*Ni, a process which consumes the 
radioactive °°Ni powering the lightcurve of SNe Ia.°° The spectra of high-z SNe Ia, 
whose metallicity is on an average lower than that of local SNe Ia, contain less inter- 
mediate mass elements, which is consistent with the idea that these SNe Ia produce 
more °°Ni to power their luminosity.®’ Metallicity may lead to an ~ 25% variation 
on the peak luminosity of SNe Ia, but observation has shown that metallicity might 
only have an ~ 10% effect on this luminosity.” Such a difference between theory 
and observation could be the result of a metallicity—age-carbon abundance degen- 
eracy, or an unknown metallicity effect. A high metallicity may lead to a relatively 
shorter delay time,?°:991 regardless of what the progenitor model is, and hence 
a slightly brighter SNe Ia. On the other hand, a star of given mass with a high 
metallicity will produce a relatively less massive WD*?*8 that has a higher carbon 
abundance,” which was suggested to produce a brighter SNe Ia.9?:9? Therefore, the 
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effect of metallicity on the peak luminosity of SNe Ia could be more complex than 
what one would imagine at first glance. 

The mass of a host galaxy may also significantly affect the properties of SNe Ia. 
For example, many observations have shown that the birth rate of SNe Ia depends 
heavily on the mass of their host galaxies.!°! 10% Compared with the age and metal- 
licity, the effect of the mass of host galaxies could be more significant. 77:84:96:104,105 
More massive galaxies tend to give rise to dimmer SNe Ia,’”"!° a trend which seems 
to be a combined result of the effects of age and metallicity on the brightness of 
SNe Ia, ie. higher mass galaxies have the tendency to retain more metals due to 
gravitational effects, and at the same time, tend to have more old stars, supergiant 
elliptical galaxies in particular. 

The above facts imply that the peak luminosities of SNe Ia evolve with redshift. 
Despite this, if the absolute magnitude of SNe Ia corrected by their lightcurve shape 
and color does not change with redshift, it is not necessarily problematic to take SNe 
Ta as standard candles for cosmology. However, people have to face an embarrassing 
situation that the age, metallicity and host galaxy mass have a systemic effect on 
the brightness of SNe Ia even after their brightness was corrected by the lightcurve 
shape and color.77:84,96,97,104,105 

For example, as shown in Fig. 7, Sullivan et al.7” found that SNe Ia originating 
in massive host galaxies and galaxies with low specific star-formation rates, of the 
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Fig. 7. Residuals from the best-fitting cosmology for a low-redshift sample as a function of host 
galaxy mass after correction for lightcurve shape and color. The red circles are the mean residuals 
in bins of host galaxy mass. The histograms on the right-hand the distribution of the residuals for 
SNe Ia in low (dashed histogram) and high (gray histogram) mass hosts, where the mass boundary 
is 10!° Mo, and the horizontal lines show the average residuals for the subsamples in low and 
high mass hosts. (For color version, see page II-CP11.) 

Source: From Ref. 77. 
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same lightcurve shape and color, are generally 0.08 magnitudes brighter than those 
found anywhere else. This result does not depend on any assumed cosmological 
model or SN lightcurve width. Such a trend may lead to a systematic error on the 
measurement of the EOSDE, if it is not corrected. Therefore, besides the lightcurve 
shape and the color, at least a third parameter to correct the brightness of SNe 
Ia, which is correlated to one or more properties of the host galaxies, must be 
incorporated to avoid systematic Hubble diagram residuals. 


5. SN Ia’s Role in Cosmology 


The previous sections have discussed the properties of SNe Ia with an emphasis on 
the corrections that can be applied to compensate for the variations in the absolute 
luminosities and also in their spectra.* In this section, we focus on how Eq. (3) 
must be modified when the condition z < 1 no longer holds. We emphasize how 
information concerning the cosmological model can be extracted from the observed 
redshift-apparent magnitude relation. 

To this end, we must first refine our definition of distance, which is complicated 
by the expansion of the universe. The “co-moving” distance doy to an object at 
redshift z is given by the integral 


dom = cHy' i dz (ay): (4) 


where H is the Hubble constant, which is a function of z, and Hp is its present 
value. For a universe filled with radiation, matter and a cosmological constant: 
Ho 
H(z’) 


= (Orel +2" 4+0md +2 4427 +O, (5) 


where Q, = (1 — QR — Qu — Qa). In the case of a “quintessence” or “dark energy” 
component of constant w = p/pc? instead of a cosmological constant (with w = —1), 
we would replace the Q, term with Qpp(1 + ghar) For 0, = 0 (a spatially flat 
universe), the “co-moving” distance as defined above coincides with the “angular- 
diameter” distance dap, but in the presence of a nonflat spatial geometry, 


Ho~*(Qx)7/? sinh[Ho(Ox)'/2dom(z)], if Qe > 0, 
dap(z) = dcm(z), if OK = 0, (6) 
ie (yn sin[Ho(—Ox)!/?dom(z)], if. <0. 


Finally, the “luminosity” distance dz is defined in terms of the “angular-diameter” 
distance according to 


dr(z) =(1+2z)dan(z), (7) 
®When the apparent luminosities of objects at differing redshifts are being compared, care must 


be taken to correct for the redshifting of the spectral bands. This is known as the so-called “K- 
correction”. 
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where the (1 + z) factor has been inserted in order to account for two effects: (i) 
As a photon propagates from redshift z to us today, its energy is redshifted, or 
diminished, by a factor of (1 + z)~1, and (ii) the rate of emission of photons is 
lowered by a factor of (1 + z)~! in terms of today’s time, rather than the time at 
the instant of emission. 

It follows that using the formula 


t=, (8) 
dy 
where LF is the absolute (bolometric) luminosity and ¢ is the apparent (bolometric) 
luminosity holds for all redshifts when the luminosity distance dz as defined above 
is used. 

The object of cosmological supernova studies is to measure the function dz (z) 
beyond its first-order linear term whose coefficient is cH) '. By measuring the 
higher-order corrections (quadratic order and beyond), we uncover the equation-of- 
state of what is driving the present expansion of the universe, and through the cubic 
and higher terms discover how this equation-of-state evolved with redshift. The SN 
Ta’s role in cosmology has been reviewed by many authors and further details on 
this technique can be found in their reviews.3%:39:106,107 
historic events in this field. 

In 1927, Lemaitre!’ deduced that the velocity of recession of distant objects 
relative to an observer can be approximated to be proportional to distance, should 
Einstein’s general relativity equation be assumed to be true. In 1929, Hubble!” 
noticed that a linear relation actually exists between an object’s recession velocity 
and its distance, judging by 22 external galaxies. This relation was henceforth 
named Hubble’s law, and hallmarks the discovery of the expansion of the universe. 
After this discovery, two questions arise naturally, i.e. what is the future of the 
universe? and what are the energy sources driving the expansion of the universe? 
To answer these questions, one needs to measure the recession velocity variation 
as a function of time or redshift, i.e. the deceleration parameter of the expansion 
qo. In modern cosmology, including Einstein’s cosmological constant,!!° the cosmic 
deceleration parameter is defined as 


go = 3 — Ox, (9) 


where Qy is the ratio between the actual mass density of the universe and the 
critical density above which the universe would collapse again someday if Q, = 
0, which is only dependent on the actual mass density of the universe, and Qa 
describes how the cosmological constant affects the expansion of the universe.!°7 
For a universe of Q, 4 0, the behavior of the universe is determined by Q), and 
Qa, together. Here, go = 0 leads to a constant expansion velocity, go < 0 leads to 
an accelerating expansion and qo > 0 leads to a decelerating expansion. It can be 
clearly seen in the equation that the expansion of the universe is accelerating if 
Q~, > (Qm/2). For a universe of QN, = 0, g = (Qm/2), and therefore, the universe is 


Here, we summarize some 
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Fig. 8. Joint confidence level for (QM, Qa) from SNe Ia corrected by MLSC method. 
Source: From Ref. 17. 


always decelerating, where q > 3, q= $ and q < 4 corresponds to the closed, flat 
and open universe, respectively (please see Fig. 8). 

Although it was known that SNe Ia are very good standard candles,’ significant 
progress on this subject had yet to be made until the 1990s, after the advent of 
CCD imagers.!!!1!? In the 1990s, two teams focused on the measurement of the 
distance to high-z supernovae, i.e. the Supernova Cosmology Project (SCP) and 
the High-z team (HZT). Both teams built up large samples, including those mea- 
sured by the Hubble Space Telescope (HST), and then individually published their 
twin studies.1%?° In the twin studies, they adjusted the parameters of an (Qm, 
Qa) universe to fit their data, as shown in Fig. 8 which is an example from Riess 
et al.,!” and obtained a similar conclusion. The surprising result is that high-z 
SNe Ia were observed to be fainter than expected from their low-z counterparts in 
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a matter-dominated universe. A matter-dominated closed universe (i.e. Quy = 1) 
was unambiguously ruled out at a confidence level of greater than 7a. Neither does 
an open, A = 0 cosmology fit the data well; the cosmological constant is nonzero 
and positive, with a confidence level of P(A > 0) = 99%. For a flat universe 
(QM + Qa=1), a positive Qa was found to be required at a confidence level of 7c. 
Actually, the probability of an accelerating expansion of the universe with (Qm > 0, 
Qa > 0, go < 0) was modest, typically 30 or less, i.e. the confidence level for a A 
dominated accelerating expansion was less than 3c. For a flat universe, the best-fit 
parameters are Qy ¥ 0.3 and Qa & 0.7, which supported an accelerated expansion 
as derived from Eq. (9). The twin studies are regarded as the original evidences for 
the discovery of the accelerating-expansion universe, and in 2011, Saul Perlmutter, 
Brian Schmidt and Adam Riess were awarded the Nobel Prize in Physics for the 
discovery. It should be noticed that the expansion of the universe is actually evolv- 
ing. At z » 0.45, the density of matter and dark energy was equal, and matter 
dominated at early epochs, i.e. the expansion of the universe was decelerating at 
e>(45" 

The discovery of accelerating expansion today implies that our universe is dom- 
inated by some unknown form of energy, which is often characterized by the ratio 
of pressure to density, ie. the dark energy equation-of-state, w = P/pc?. For some 
arbitrary length scale a, the density of the universe evolves as 


poa MH), (10) 


From this equation, it is easily seen that w = 0 for normal matter, i.e. the density 
decreases as the universe expands. If w = —1, which corresponds to Einstein’s cos- 
mological constant, its density is a constant, and does not dilute with the expansion 
of the universe. A value of w < —1 is permitted although it is debatable theoret- 
ically. In this situation, the dark energy density will increase with time, and may 
ultimately lead to the destruction of galaxies.27!!% If —1 < w < 0, the dark energy 
density will decrease with time, and it will become problematic whether the expan- 
sion of the universe will continue to accelerate if the value of w is large enough. 
If Qu is assumed to be negligible, a value of w < -3 for dark energy would be 
sufficient for the indefinite acceleration of its expansion. However, considering that 
it is very likely for Qy, to have a value of larger than 0.2,1%?° the dark energy 
has to have an even smaller value of w for this to happen. It is verified that the 
critical point for the expansion of the universe to continue accelerating is in fact 
w= —5.i4 However, it is possible that w is not a constant, but rather a variable 
which also evolves with time in the general form w(z) = wo + waz/(1+ 2), where 
w will be a constant wo if wa = 0. 

Following the discovery of dark energy, the EOSDE and the evolution of EOSDE 
with time became burning issues. Although the measurement of w is still con- 
sistent with an accelerating expansion of the universe, the uncertainty of w is 
still very large in those early observations. The value of w was generally located 
between —0.7 and —1.6 at a typical confidence level of 95%.?2!4115 Today, more 


SNe Ia as a cosmological probe II-167 
SNLS3+SDSS DR7 LRGS+WMAP7+4H, (Flat) 


—0.25) 


-0.50 


—0.75) 


Wo 


—1.00 


—1.25 


—-1.50 


0.20 0.25 0.30 0.35 wT =15 -1.0 -0.5 
Om Wo 


Fig. 9. Confidence level contours in the plane of cosmological parameters (Qu, wo, Wa). The 
horizontal dashed lines indicate a cosmological constant, i.e. wo = —1 and wa = 0. 
Source: From Ref. 120. 


and more evidences based on SNe Ia observations from SNe Ia groups and other 
observational constraints, such as those from Supernova Legacy Survey (SNLS), 
Harvard-Smithsonian Center for Astrophysics (CfA), Sloan Digital Sky Survey 
(SDSS) and SCP, support a flat universe with a constant w = —1, i.e. at least 
rapidly evolving forms of dark energy were ruled out although the current obser- 
vational constraints on wa are weak.3436116-120 For example, in Fig. 9, combined 
with full Wilkinson Microwave Anisotropy Probe 7-year (WMAP7) power spec- 
trum, SNLS3 and SDSS DR7 data, Qu = 0.271 + 0.015, wo = —0.905 + 0.196 
and wa = —0.9841}:08? were obtained, which are consistent with a flat, w = —1 
universe.!?° For most of these studies, the HST played a vital role in determining 
the cosmological parameters, and provides many high-quality, high-z observations 
which remain one of our most efficient tools for understanding the properties of 
dark energy.'!°:116121 An event at z = 1.914 was even discovered recently by HST, 
whose magnitude is consistent with that expected from ACDM cosmology.!2” 


6. Issues and Prospects 


SNe Ia are now applied to constrain the properties of the dark energy and the 
evolution of the properties with time. Although SNe Ia may provide more strict 
constraints than some other surveys, such as baryon acoustic oscillations (BAO) 
and weak gravitational lensing (WL), the combination of the results from more 
than one survey is often applied to constrain the EOSDE parameter.!!3:!7° For 
dark energy, a change of 1% in w only leads to a change of 3% in dark energy 
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density at redshift z = 2 and a change of 0.2% at z = 1—2.11!3 Therefore, to study 
the properties of the dark energy quantitatively, a precision of better than 2% to 
measure distance is required.!?? However, even for the most well-observed sample, 
the present distance accuracy seems not to be good enough. At present, a well- 
observed low redshift sample may fulfill the requirement of 2% precision, but the 
distance error increases to ~ 5% at redshifts greater than 1.1°1?7 As we know, 
SNe Ia with redshifts greater than 1 provide the best constraints on the EOSDE. 
For precision cosmology, there are still several difficulties with regard to SNe Ia. 
First, not all SNe Ia may be taken to be distance indicators, such as 2002cx-like 
supernovae.!?4 Such SNe Ia conform to no relations which regular SNe Ia do, such 
as the Phillips relation, and therefore cannot be standardized. Another subclass of 
SNe Ia that do not adhere to the Phillips relation are the so-called superluminous 
SNe Ia.!?° If these peculiar supernovae are not excluded from an SNe Ia sample, 
the confidence level of the resulting distance measurement would decrease greatly. 
Second, new lightcurve analysis methods which have been developed tend to be 
based on new intrinsic parameters, which are assumed either to be invariant with 
redshift, or at least to vary with redshift in such a way that does not introduce 
further systematic uncertainty. Despite the reduction of the luminosity uncertainty 
from 0.15 magnitudes to less than 0.10 magnitudes, or better than 5% in terms of 
distance uncertainty,**!76 which these methods serve to achieve, such assumptions 
ought to be checked carefully, since it has been shown that systematic-uncertainty- 
inducing evolution may exist ,®87 and besides the lightcurve shape and the color, a 
third parameter correlated to one or some properties of the host galaxy is necessary. 
Finally, it is crucial to increase the detection efficiency of SNe Ia at high redshifts 
for further studies, especially at redshifts greater than 1. Two factors may affect 
this detection efficiency. One is that only luminous SNe Ia are likely to be detected 
due to Malmquist bias, but luminous SNe Ia are relatively rare. The other is that 
most of the radiation may be shifted into infrared bands for the SNe Ia at high 
redshifts, which makes it difficult to detect SNe Ia by optical telescopes. Infrared 
space telescopes, in this context, may be an alternative for future SNe Ia science.*® 
For example, the recently installed Wide-Field Camera 3 on HST has granted SN 
surveys an unprecedented depth, i.e. it is possible to detect a typical SN Ia at z = 2.5 
by HST. In addition to the difficulties mentioned above, for high-z supernovae, 
the confidence level for classification by spectrum is not as high as that for low-z 
supernovae. For a high-z SNe Ia sample, although practical studies of photometric 
identification have developed methods which achieve a purity of ~ 95% for SNe 
Ia,!28180 there exists a possibility that such samples may be contaminated by a 


122,131 Therefore, techniques to remove such sample 


few Type Ib/Ic supernovae. 
impurities need to be developed. 

However, if an appropriate observational strategy is adopted, the effect of 
time dilation could provide a great opportunity for high-z SN observation. If the 
cosmological redshift is derived from the expansion of the universe, the observed 


time interval of an event will be dilated by a factor of (1 + z), which means that 
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the lightcurve of a high-z SN Ia will present a broadening peak and a more shallow 
slope for the later exponential decline phase. Therefore, the effect of time dilation 
increases the probability to detect the high-z SNe Ia, although it becomes relatively 
difficult to obtain a complete light curve. 
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Gravitational lensing is a unique and direct probe of mass in the universe. It depends only 
on the law of gravity and does not depend on the dynamical state nor the composition 
of matter. Thus it is used to study the distribution of the dark matter in the lensing 
object. Combined with the traditional observations such as optical and X-ray, it gives us 
useful information on the structure formation in the universe. The lensing observables 
depend also on the global geometry as well as on the large scale structure of the universe. 
Therefore it is possible to derive useful constraints on cosmological parameters once the 
distribution of lensing mass is accurately known. Since the first discovery of a lensing 
event by a galaxy in 1979, various kinds of lensing phenomena caused by stars, galaxies, 
clusters of galaxies and large scale structure have been observed and are used to study the 
mass distribution at various scales and cosmology. Gravitational lensing is now regarded 
as an indispensable research tool in observational cosmology. In this article we give a 
pedagogical introduction to gravitational lensing and its applications to cosmology. 


Keywords: Cosmology; gravitational lensing; strong lensing; weak lensing. 
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1. Introduction and History 


Propagation of light rays from a distant source to the observer is governed by the 
gravitational field of the intervening mass distribution. This fact was realized even 
before Einstein formulated the general theory of relativity. It seems that Cavendish 
and Michell are the first who presented the idea that the bending of a light ray 
occurs near a massive body around 1780 (see Valls-Gabaud? for the detailed his- 
tory). Then German astronomer, Soldner, calculated the bending angle of a light 
ray around a massive object in 1801 using Newtonian gravity and obtained one half 
of the correct value.? 

The correct value can be only calculated by knowing the fact that spacetime is 
curved by a massive object. General relativity provides a method to calculate the 
spatial curvature around a massive object. Einstein calculated the bending angle of 
a light ray grazing the surface of the sun correctly as 1.76 arcsec (Einstein, 1915). 
In 1919, a team led by A. Eddington succeeded to measure the change in position 
of stars around the sun during an eclipse and obtained results consistent with the 
prediction by general relativity. 
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Eddington explained the gravitational deflection of light by the analogy of lens- 
ing effect in both 1911 and 1915 predictions of the bending angle. In 1924 the Rus- 
sian scientist Chwolson pointed out the possibility of producing fake double stars 
by gravitational lensing and furthermore without detailed calculation he pointed 
out a possible existence of perfect ring image if the source is located exactly on 
the line of sight of the lens.* In 1936 a Czech amateur scientist, Mandl, visited 
Einstein at Princeton asking about the possible existence of multiple images of a 
distant star due to the gravitational force of a star. Stimulated by the discussion, 
Einstein wrote a paper on gravitational lensing by a star with detailed calculation 
and showed the existence of the so-called Einstein ring, predicted by Chwolson.*® He 
concluded however, that gravitational lensing by a star is unlikely to occur because 
of the extremely small probability of lensing. It should be noted that Link studied 
similar lensing event before Einstein. He calculated not only the position of the 
image but also magnification.®’ Zwicky estimated the probability of lensing of one 
galaxy by another galaxy as ~ 107° (Ref. 8) which is much larger than that of a 
star and tried to discover lensing phenomena in galaxy clusters. The attempt failed 
mainly because the lensed images are too faint to be detected by the apparatus 
available at that time. 

After that there were no active researches on gravitational lensing except some 
important papers. In 1960 Russian astronomers Idlis and Gridneva wrote a paper 
which is regarded as the precursor of the weak lensing idea. In 1964 Refsdal® and 
Liebes!? argued that the gravitational lensing effect may be used to measure the 
mass of a lensing object. Refsdal also pointed out that if a source’s luminosity varies 
in time, there will be a time lag of variability between the lensed images which can 
be used to estimate the Hubble parameter. 

The first lensing system QSO 0957+561A, B was discovered in early 1979 by 
Walsh, Carswell, and Weymann.'!! They made photometric observations of two 
images of a single quasar at z = 1.41. The lensing galaxy was also observed at 
redshift z = 0.355 in a cluster of galaxies. In 1997 the time lag of the variability 
was observed as 417 +3 days between images A and B.!? Since then, other systems 
with multiple images have been observed, and the list has grown to approximately 
200 by now. Time lags have been observed in more than 10 lensed systems as well. 
This list is expected to be on the order of 100 in the coming decade. 

In the late 1980s, high resolution observations by the Hubble Space Telescope 
(HST) became available and discovered a very elongated image called a giant lumi- 
nous arc (GLA) in the cluster Abell 370 in 1986.13 A multitude of arcs and distorted 
images in massive clusters have been described by Soucail et al.!4 An Einstein ring- 
like image was also found in the system MG 1131+0456 in 1988.1° 

Gravitational lensing is classified into three regimes: strong, weak and micro 
lensing. Figure 1 shows a schematic representation of strong and weak lensing phe- 
nomena. Strong lensing results in multiple images and/or highly deformed arclike 
images, called giant luminous arcs. It is used not only to measure the spatial dark 
matter distribution in the lensing object (e.g. Kneib et al.‘®), but also to constrain 
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the cosmological parameters. It has been shown that a good reconstruction of a 
lensing system may constrain the density parameter and the cosmological constant 
(Futamase and Yoshida, 1999; Yamamoto et al., 2001; Soucail, Kneib and Golse, 
2004; Meneghetti et al., 2005; Jullo et al.'”). 

An important application of strong lensing is to use it as a natural telescope. 
Strong lensing magnifies small and faint objects to make them observable, objects 
that are otherwise not observable. In particular, strong lensing by clusters has 
been used to study distant galaxies (Ellis et al.'8; Kneib et al.!9; Richard et al.?°; 
Bouwens et al.2'; Coe et al.?”). For this study, a highly accurate determination of 
mass distribution is necessary, and this becomes possible using a large number of 
multiple-image and arcs discovered by the HST profitting from progress in lens- 
modeling techniques. It is claimed that the mass in cluster cores is measured to a 
few percent accuracy (e.g. Bradac et al.???4; Jullo et al.2°; Jullo and Kneib?°). In 
the case of clusters observed by the Hubble Space Telescope Frontier Fields galaxies 
at z > 10 and as faint as 32 AB magnitude (about magnitudes fainter than the 
limit of Hubble Ultra Deep Field) are expected to be detected (Richard et al.?’). 
Future prospects for this study seem very promising. 

Another application of strong lensing is lens statistics (Turner, Ostriker and 
Gott?8). It has been shown that lens statistics are a good indicator of cosmological 
parameters, in particular, the cosmological constant (Turner?’; Fukugita, Futamase 
and Kasai?°). 

Weak lensing results only in weakly deformed images and the lensing signal 
is obtained only by averaging over background galaxies. While weak lensing is not 
spectacular compared with strong lensing, it is very important and useful. For exam- 
ple, strong lensing images are only observed in the central part of a cluster, which 
constitutes only a tiny part of the whole mass distribution of the cluster. Infor- 
mation from weak lensing is necessary to reveal the whole mass distribution of the 
cluster. The method of mass reconstruction by measuring the weak lensing effect was 
developed and applied to Abell 1689 and Cl 1409+53 by Tyson et al.?! More sophis- 
ticated methods of weak lensing mass reconstruction were developed by Kaiser and 
Squires*? and Kaiser, Squires and Broadhurst (now known as the KSB method?®). 
This method and its refinements have been applied to many clusters of galaxies. 
Weak lensing by large scale structure known as cosmic shear depends on the growth 
rate of the structure and thus on the cosmic expansion rate as well as the law of 
gravity. Thus, the accurate measurement of cosmic shear is expected to constrain 
the nature of dark energy and the deviation from general relativity. The first detec- 
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tion of cosmic shear was made in 2000 by several groups, 
most active area of research in gravitational lensing. 
Microlensing is a type of strong lensing by a stellar size object but whose deflec- 
tion angle is on the order of micro-arcsec. Thus no separate images are observed, 
but only the magnification of the image is observed. It is used to discover MAssive 
Compact Halo Objects (MACHOs)*! and planets outside of the solar system. We 


will not consider microlensing in this review. 
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For general review of gravitational lensing, we refer the reader to Schneider, 
Ehlers and Falco,*? Blandford and Narayan,** Refsdal and Surdej** and Narayan 


and Bartelmann.*° 


2. Basic Properties for Lens Equation 


The global geometry and overall evolution of the universe is well described as homo- 
geneous and isotropic by the Friedman—Robertson—Walker (FRW) metric. However, 
the matter distribution is highly inhomogeneous, up to 100 Mpc as we observe. A 
light ray propagates through these inhomogeneities feeling the local gravitational 
potential. How the averaged FRW universe appears from the local inhomogeneous 
universe within the framework of general relativity is a complicated and difficult 
problem. Fortunately, the gravitational lensing phenomena considered in this review 
are caused by weak gravitational fields and we may regard the field as locally sta- 
tionary on the time scale of the deflection of a light ray. This allows us to use a 
simple treatment of light ray propagation. We will derive the gravitational lensing 
equation in the cosmological situation starting from the geodesic equation and then 
summarize the basic concepts and properties of this equation. 


2.1. Derivation of the cosmological lens equation 


To study the propagation of a light ray, we need to specify the background geometry. 
Since we can assume that the gravitational field is weak everywhere and locally 
quasi-stationary, it is sufficient to consider the following metric: 


ds? = a?[—(1 + 2W)dn? + (1 — 2) (dy? + r?(x)dO)], (1) 


where dQ? = d6? + sin? @d¢? is the line element on the 2-dimensional unit sphere 
and p= * dt! /a(t’) is the conformal time. r(x) is the comoving angular diameter 
distance defined by 


—1/2 sin XxX 
K Ss (+5) KkK>0 
r(x) = 4% 0 (2) 


=e ee xX 
(kK) sinh (AG) Kk <i. 
The curvature parameter K is expressed by the total density parameter at present 
OQ as kK = (Qo —= 1) HF. 

Although the above metric is the first-order deviation from the FRW_ back- 
ground, it may still apply in the situation with a fully nonlinear matter distribution. 
This can be understood by noticing that the Newtonian potential is determined by 
the cosmological Poisson equation. 


V2U = 4nGa* pdm, (3) 
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where py is the background density and 6,, is the matter density contrast defined 
as Om = (Pm — Po)/Po- This equation gives 


i ie 
“a (+) iv}, (4) 


where € and Ly = Ho 1 denote the characteristic comoving scale of the density 
fluctuation and the Hubble radius, respectively. Thus one can safely assume |V| << 1 
even if dm >> 1 as long as £< Ly (Futamase*®) which is always assumed in lensing 
studies. 

The propagation of a light ray is described by a null geodesic equation. 


Om 
a 


dk! 
mre gern? = 0; (5) 


Guvktk” = 0. (6) 


One can solve the above equations perturbatively with the Oth order being the 
solution in the homogeneous and isotropic background 


k#(A) = kK #()) + 6k#(A). (7) 


By choosing the origin of the coordinate at the observer (us) and assuming a 
small deflection angle everywhere, we can set the path of the background solution 
as follows: 


K)# — (-1,1,0,0), (8) 


where we choose polar coordinate (t,r,0,¢) and carry out backward ray-tracing 
from the observer at A = 0. The integral curve is chosen as 


e)# — (—), d, 67,0), (9) 


where |@;| < 1 is interpreted as the image position angle. Since we observe the 
projection effect on the sky, we are interested in the perpendicular components to 
the line of sight of the above equation. The perpendicular components of the first 
order solution may be written as 


Pe Xr 
bk°(A) = -= dX'0°U(2'), 10 
O)=-aay f xorwe) (10) 
where a = 0,¢ and 0° = (© 9,6 4/sin? 6). The Newtonian calculation gives the 


same expression with a factor of 1 instead of 2. The difference is due to the effect 
of spatial curvature. This may be integrated to give: 


6%(\) = mop ay ow ‘eo (11) 
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Since lensing phenomena are always observed in a small patch of the sky, we 
can safely ignore the curvature of the sky (flat sky approximation) and use locally 
Cartesian coordinates 0 = (61,02) around the line of sight k‘)”. Defining the source 
plane coordinate as 3 = @(A,) where Xg is the location of the source and the image 
plane coordinate as 8 = @,, one can write the cosmological lens equation as follows: 


B=0-a(8), (12) 


where the projected bending angle is expressed by the Newtonian potential V as 
follows: 


xs (As — A) 

a= dS V8) W(X (A), (13) 
[ r(As) 

where V) is the 3-dimensional gradient operator and 0, = Og /r(A). This suggests 

that the bending angle may be expressed by the 2-dimensional gradient Vg of a 

2-dimensional potential (61, 02) 


a= Vor, (14) 


where Vo = (09, , 00,). 

In particular, if the lensing object lies within a sufficiently small region between 
At + A\(AXA « Az) such as clusters and galaxies compared with the relevant 
cosmological distances, one can assume that the light ray is deflected at A = Az. 
The projected bending angle takes the form 


g Peta 
— ak dX VO W(a(d)), (15) 
\r—-AY 

where Dg = a(As)r(As) and Drs = a(\s)r(As — Az) are the angular diameter dis- 
tances from the observer to the source, and from the lensing object to the source, 
respectively. This is called the thin lens approximation. Figure 3 shows the config- 
uration of the lensing equation in the thin lens approximation. 

Using the above expression for a, one finds 


D, Drs 87Gd 


2k =Vo-a=Agu= Ds = 


(16) 


We also defined the convergence «, which is the normalized surface mass density 


I) 
d= 1 
nO) =, (7) 
where the critical surface mass density is given by 
CDs dg 
= Ee 10 ar 18 
4nGDysDrz a (4) even ( ) 


where dx denotes the angular diameter distance Dx in units of c/Ho. 
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Fig. 1. The illustration of the gravitational lensing mechanism (see Ref. 1). 


A region of a mass distribution with © > Ug, can produce multiple images for 
some source positions and is referred to as being super-critical. One can write the 
bending angle in the following more tractable form using the surface mass density 
Xu, or the convergence k: 


1G Duis : 0-6" : 0-6! 
a= ASS feos 0'S(0') ea oe = = [ore 6':(0') = enor (19) 


2.2. Properties of lens mapping 


One may regard the lens equation as mapping from the source plane to the image 
plane. It is not a one-to-one mapping, in general. The Jacobian of the mapping 
characterizes the local distortion of the mapping: 


Opi 


Aij = 
1 00; 


= ois — A = Oi; = Wag: (20) 


This 2 by 2 symmetric matrix may be decomposed into a trace and traceless part 
as follows: 


1-K-1 —72 1 0 1 Ye 
A(@) = =(l—kK 3 
m7 ( =e = *) hy ( , (” =) en 
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Fig. 2. Giant luminous arc in galaxy cluster Abell 383. 
Source: http://www.spacetelescope.org/images/heicl106a/. 
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Fig. 3. Geometry of the gravitational lensing. 
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where y = ¥, +772 is the gravitational complex shear defined by 
1 
V1 = || cos(2) = 5 (Pat — 22), (22) 
V2 = ly sin(2¢) = dy. (23) 
In general the quantity a is called a spin-s quantity when a changes as 
a See, (24) 


under the rotation of coordinates by an angle ¢. The convergence and gravitational 
shear are then the spin-0 and spin-2 quantities, respectively. Using this property, it 
is convenient to define the tangential and radial components of the shear relative 
to some reference point 89. Noticing that the tangential component of the shear 
is obtained from 7 by rotating the angle 7/2 — ¢ (where ¢ is the polar angle of 
6 — 60), we have the following: 


714(0; 00) = —Re(ye~*"*) = —y1 cos 2¢ + 2 sin 2¢, (25) 
yx (8; 80) = —Im(ye~*) = 71 sin 2¢ — 72 cos 2¢, (26) 


where Re(z) and Im(z) denote the real and imaginary part of a complex number z, 
respectively. y, and yx are referred to as tangential and cross shears, respectively 
(Fig. 4). Under the parity transformation, these change as y% — 4, x — —Yx:- 
Tangential shear plays an important role in cluster mass reconstruction and cosmic 
shear, as shown later. 

The convergence and the shear are related in the following way: 


(0) == | o'D(0 — o")x(6") (27) 
Te 
with 
Lf OF 07) 5g 0 02-02 — 2100) ei? 
uy 5 (Fe a) isgeaas | inl6l = a/* =——pr — (28) 
Ys Y2 


Fig. 4. The relationship between 71, y2 and 74, yx.- 
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where we have introduced polar coordinates (6,¢) such that 6 = (0 cos ¢,@sin@). 
Note that there exists a global transformation that leaves the shear y(@) invariant 
such that 


K(0) > K(0) + Ko, (29) 


where Ko is an arbitrary constant. This degeneracy leads to an ambiguity in the 
mass reconstruction based only on the image distortion in weak lensing analysis. In 
practical calculations, it is more convenient to work in Fourier space because one 
can make use of the Fast Fourier transform. The kernel is expressed as follows in 
Fourier space: 

~ ke = ke + 2iky ke Did: 

D(k) = Fa = reree (30) 
where k = (k,¢,) is the vector of the polar coordinates in Fourier space. Since 
D(k)D*(k) = 77, the inverse of (26) is straightforward: 


a(k) = =D"(b)4(), (31) 


where & and ¥ are the Fourier transforms of the corresponding quantities. 

The geometrical meaning of convergence and shear is easily seen by considering 
the deformation of a circular source. Let us consider a circle with radius d3. Consider 
a small circle defined by the equation 


dé" dé = 1, (32) 


where d@ is a position vector from the origin and dG” is its transverse. Then the 
image mapped by the lensing satisfies the following equation: 


1 = dé" A’ Ado. (33) 
The matrix A has the following eigenvalues: 


Az =1-K |] =(1-a)(1 £191), (34) 


where g = y/(1—&) is called the reduced shear. The associated eigenvectors are 


TT 
cos @ won (0 ~ =) cos @ 
Vi = : = T V_= . ) (35) 
—sing gin (o = =) sin @ 
where ¢ = 4 tan7! (go /gi) is the position angle of the eigenvector V_. Using the 


orthogonal matrix O which diagonalizes the matrix A, one finds that the unit circle 
is mapped to the following ellipse (see Fig. 5) 


d0i\? 2” 
ae +} 4 =, (36) 
v- A+ 

where dO = Od@é. The dQ, is the coordinate along the eigenvector V_. 
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Fig. 5. The unit circle is mapped to an ellipse with the length of the major axis 1/|A_| and the 
minor axis 1/|A+| for kappa < 1 and its inclination from dé axis is ¢. 


Since 1/|A_| > 1/|A,+| for « < 1, this is the equation of an ellipse elongated in 
the direction of ¢ (tangential direction) with the length of the major axis 1/|A_| and 
minor axis |1/A+|. For « > 1 the ellipse is elongated in the direction ¢— 7/2 (radial 
direction) with the length of the major axis 1/|A;| and the minor axis 1/|A_]. It is 
interesting to note that the distortion disappears along the curve defined by «(0) = 
1, which always lies in the odd-parity region. Moreover in the region defined by 
& ~ land y ~0 the mapped image is highly magnified without deformation. These 
are referred to as GRAavitationally lensed yet Morphologically Regular images, or 
GRAMORs. Such an image was predicted in 1998 by Williams and Lewis*’ and 
Futamase, Hattori and Hamana* and was discovered in the galaxy cluster MACS 
J1149.5+2223 (Broadhurst et al.4%). 

Thus the convergence expresses an isotropic magnification (« > 0) or demagnifi- 
cation (K < 0). The (reduced) shear g expresses a deformation without changing the 
area. The component gi expresses the deformation along the 6)(02) axis for gi > 0 
(gi < 0). The component g2 expresses the deformation along the axis rotated 45° 
from the 6; axis. 


2.3. Caustic and critical curves 


According to Liouville’s theorem, surface brightness is conserved by gravitational 
lensing. This may be shown as follows. The energy in the interval dv radiated in the 
solid angle dQ through the area ds in the time dt may be written as IcdtdsdvdQ. (I 
is the surface brightness). This may also be written using the photon distribution 
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function f(a, p) as 
IcdtdsdvdQdv = hv f Pad? p = 4th? v* fcdtdsdvdQ. (37) 


Thus I x v°f. Liouville’s theorem tells us that f = constant and v = constant. 
Because there is no energy exchange in gravitational lensing, the surface brightness 
is conserved. Thus the magnification of the flux due to lensing is given simply by 
ratio of the solid angle between image and source. 

This is given by the inverse of the determinant of the Jacobian matrix A: 


1 1 1 
OS det ~ =e?) 00) FP 


(38) 


Images with det A > 0 (A_ > 0 or Ay < 0) have the same parity as the source, 
while images with det A < 0 (A; > 0 and »_ < 0) have parity opposite to the 
source. 

The closed curves in the image plane on which the point image is magnified 
indefinitely are called “critical curves” which is defined by the condition A+ = 0. 
The corresponding curves in the source plane are called “caustics”. The critical 


curves separate the image plane into even- and odd-parity regions. In practice, 
realistic objects like galaxies have a finite size and det A = 0 applies to every part 
of the object to be magnified indefinitely. Thus the extended source is not magnified 
indefinitely on the critical curve, but is highly magnified. Such images are called 
luminous giant arcs. If the lensing object is sufficiently massive and satisfies k > 1 
in some region, then det A(@) = 0 is satisfied somewhere in the image plane and 
a critical curve exists. When convergence is a decreasing function of the distance 
from the center of the lensing object, the curve defined by A+ = 0 is called the inner 
critical curve and the curve defined by A_ = 0 is called the outer critical curve. 
Since & > 1 on the inner critical curve, the image is elongated in the radial direction 
(radial arc), and thus the inner critical curve is sometimes called the radial critical 
curve. On the other hand « < 1 on the outer critical curve, so the image on the 
outer critical curve is elongated in the tangential direction (tangential arc) and the 
outer critical curve is called the tangential critical curve. 

The configuration of caustics determines the number of images. This is explained 
by the lens model examples below. The point determined by the condition «(@) = 1, 
namely 4(@) = ¥¢, is the critical point in the absence of shear. Although the shear 
is non-zero in general, the region where «(@) > 1 is referred to as the strong lensing 
region associated with multiple images, while the region where «(@) < 1 is referred 
to as the weak lensing region. 


2.3.1. Circular lenses 


Let us now consider simple examples of a lens model. First, we consider a lens with 
a spherically symmetric mass distribution. In this case, the bending angle at a point 
6 is determined by the projected mass (0) enclosed by a circle of radius 0 = |6| 
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around the center of mass, and is given by 


_ Drs 4GM(6), 
a(d) = DiDs 20 6=«K(0)0, (39) 
where & is the average of « inside the radius 0: 
2 1 M(@) Drs 4GM(6) 
= = 4 
Me) = 5 a(Di8)?  DeDs 282 oe) 
The lens equation becomes the scalar equation: 

B(0) = 0 —R(0)0. (41) 


This shows immediately that the image with 0 > 0 satisfies 0 > 3 and an image 
with 6 < 0 (opposite side of the source projected on the image plane) lies in the 
region with &(@) > 1. The Jacobian matrix is 


7 _ Odk\ f1 0\ 1 da ( (61)? — (62)? 2010 
A(@) = (1 -k- sa) ( ; na ( sed, fis 7 (42) 


Remembering that the mean convergence is a decreasing function of the distance 


from the center in general and y = —0/2dk&/d0, the eigenvalues are then calculated 
to be 
d dp 
= 1 f — 1 OB 0 =—, A 
Ap = 1-nt7=1-5 on) =F (43) 
B 


A~=1-K-y=1-&(6) = 


a" (44) 

As explained above, the eigenvalue A_ describes the tangential deformation of 
the image due to a geometrical effect, and A describes the radial deformation of 
the image due to the gravitational tidal effect which depends on the radial density 
profile of the lensing object. 

In the spherical lens case, the lens mapping may be understood by specifying a 
curve in the (3,0) plane (see Fig. 6, lens mapping). The curve is characterized by a 
mass distribution. Figure 6 shows the mapping given by a point mass where there 
are only two images. If the lensing mass distribution is non-singular at the origin, 
the mapping is shown in Fig. 7. As seen in this figure, the caustic corresponding to 
the outer critical curve is the origin in the source plane (actually the line in this 
case). 

When a source of a finite size approaches the caustic from the inside, two images 
with opposite parity approach perpendicularly to the inner critical line and these 
two images disappear when the source passes across the caustics. The opposite 
phenomenon occurs when the source approaches the caustic from the outside. Thus, 
the source must be inside the caustics to make multiple images. This may be more 
easily seen in Fig. 8, where the critical lines and caustics are shown in the image 
plane and source plane, respectively. This is a general property for caustics which 
are locally regarded as a straight line. Such caustics are called “fold caustics” . 


II-186 T. Futamase 


Lens mapping 


B=B, $s 
6 

Image geometry 
Image parities i | | 

Fig. 6. Lens mapping given by a point mass. 
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Fig. 7. Lens mapping given by a non-singular lens model. 
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Fig. 8. Caustics and critical curves for a spherical lens. (For color version, see page II-CP12.) 
Source: http: //gravitationallensing.pbworks.com/w/page/15553257/Strong%20Lensing. 


2.3.2. The Einstein radius and radial arcs 


From symmetry, the tangential caustics are given by 6 = 0. The image is a ring on 
the tangential critical curve, provided that the lens configuration is super-critical. 
Such a ring is known as an Einstein ring and the angular radius is called the Einstein 
radius. The Einstein radius is obtained by solving the lensing equation by taking 


B=0: 


On = 4GM(@z) Dis oe dete M(x) M2 Ding \ (45) 
= @  DrDs  |10@hA-1 Mo Dp Ds 


In actual lensing events, a source such as a galaxy is not perfectly located at 
3 = 0. In that case, we observe a pair of highly magnified, stretched images (called 
arcs) along the tangential critical curve, with a shorter arc just inside and a longer 
arc just outside of the critical curve. Thus, the position 6,;- of the tangential arcs 
is used to roughly estimate the mass of a lensing object inside the circle traced by 
the arc as follows: 


_1 (DiDs\ (92 \? 
M (Oare) = M (6g) = 3.31 x 10!*h 7 (3s) (5) Mo. (46) 


We now consider radial arcs which are sometimes observed in galaxy clusters. 
As shown above, the condition needed to have a radial arc is Ay = 0. Radial arcs 
are very useful to study the radial density profile. For example, let suppose that 
the convergence has a simple power law form as 


(0) = Ko (F). (47) 


with 0 <n < 2. The mean convergence is then given by 


(48) 
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The condition A+ = 0, i.e. d[0%]/d0 = 0 becomes 


i a) 60. (49) 


Thus the condition n < 1 is required for a lens to have a radial critical curve. 
As typical examples of a spherical lens, we consider two models. The first is the 
singular isothermal sphere (SIS) model. The 3-dimensional density is given by 


o2 


Te po} 


The convergence and the averaged convergence are given by 


p(r) 


Oy\2 Drg 1 

n(n) = 2nG (=) art (51) 
v\2 Drg 1 

(0) = 4nG (2) ar (52) 


The next example is the so-called NFW profile, which is a numerically predicted 
universal profile for CDM halos given by Navarro, Frenk and White (NFW).!”° The 
density is given by 


(53) 


where Perit; Pc and r, represent the critical density of the universe, the characteristic 
overdensity of the CDM halo, and the scale radius, respectively. The convergence 
and the averaged convergence are given by Bartelmann°? and Wright et al.>! 


m(r) = wa (5), (54) 


K(0) = 2ksg (5) @ ~ (55) 


where Kk, = Di cpuit EL and 0, = r;/Dr. The function f(a) and g(a) are then 
given by 


2 1- 
14 arctanh — r<l 
1-2? «of (1 = x?) 1+ 
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g(e) =n (S) +491 =e (57) 


2.3.3. Non-circular lenses 


There are no exactly circular lenses in nature except a star, which we do not treat 
here. Any lensing object is somehow non-circular. Usually rather than giving the 
explicit distribution for the surface mass density or 3-dimensional mass density, the 
gravitational potential is given. Several interesting models exist for non-circular 
potentials. As far as an elliptical potential is concerned, the general model contains 
at least five parameters (an indicator of the strength of the lens such as the Einstein 
angle in the circular case, core radius, ellipticity, the power law index of density 
falloff and the angle of the major axis from a fixed direction). For example, the 
Tilted Plummer Elliptical potential is given by 


2 n/2 
jan=e (23) | (58) 


Uf OF 


where rz, = x?(1—e€))+y?(1+€p), wp is related to the core radius, €p is the ellip- 
ticity and 7 is the power law index. 7 = 1 corresponds to the isothermal case. The 
reader interested in this model should consult the paper by Grogin and Narayan.*? 

Although one needs a specific potential for model fitting, general properties 
of elliptical models are similar for any known model. Figure 9 shows small circular 
sources in the source plane and the corresponding images in the image plane. In the 
elliptical case, there exist inner caustic and inner critical curves. The inner caustics 
have points where the derivative along the curve is not well defined. Such a point is 
called a cusp. Five images exist when the source is inside the inner caustic. Three 
images exist when the source is between the inner and outer caustics. As seen in 
Fig. 9, when a sufficiently small finite source approaches the cusp from the inside, 
three images (two images parallel along and outside the caustics, one perpendicular 
to and inside the caustics) approach a corresponding point on the critical curve 
and vanish when the source passes through the cusp. When the source approaches 
a cusp from the outside, the opposite phenomenon occurs and three images seem 
to appear from the corresponding point on the inner critical curve. See Bland- 
ford and Kochanek®? for a more systematic understanding of caustic and critical 
curves. 
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Fig. 9. The caustics and critical curves of an elliptical lens with sufficiently small core. The 
upper and bottom panels are the source plane and image plane, respectively. The sources are 
circular with same size. The corresponding (same color with source) image positions and shapes 
are shown. The left part of figures show the case when the source is close to inner fold caustics. 
On the other hand, the right part of figures show the case when the source is close to inner cusp 
caustics (Hattori et al.5*). (For color version, see page II-CP12.) 


3. Strong Lensing 
3.1. Methods of solving the lens equation: LTM and non-LTM 


In strong gravitational lensing, there are a number of observables such as the relative 
positions of images, relative fluxes of images and time delays of luminosity changes 
between lensed images. These observables depend not only on the mass distribu- 
tion of the source but also on the cosmological parameters. There are essentially 
two methods for constructing mass models of the source. The first is called the 
parametric method, which assumes a simple but physically reasonable model of 
potential such as an isothermal sphere, NFW model, elliptical pseudo-isothermal 
model and so on with parameters that have clear physical meanings. Physically 
reasonable means that one infers the shape of the lensing potential from the image 
position as well as the lens position and shape assuming the existence of a strong 
correlation between light and mass. Thus, this method is now called the light traces 
mass (LTM) method. In the case of a galaxy lens, the relative position of the images 
and the existence of arcs gives us enough information to determine the form of the 
potential. 
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The other method is called the non-parametric method, which is also referred 
to as the “grid-based” method or non-LTM method because one does not assume 
the potential shape from the beginning. This method has become both possible 
and popular since deep observations using 8-10 m telescopes and the HST reveal 
detailed shape information of images for galaxy lenses, and a multitude of arcs and 
images for cluster lenses. Such a wealth of information allows us to establish the 
grid by grid correspondence between the image plane and the source plane. The 
basic principle of this method is the conservation of the surface brightness. Namely, 
the surface brightness of two grids related by the lens equation is the same. 

We will not go into further details of these two methods, partly because there 
are many strong lens model software codes using both LTM and non-LTM methods, 
which are publicly available. Instead we shall focus on the basic properties of strong 
gravitational lensing. For more detailed treatment of strong lensing, see an extensive 
review by Kochanek.°° 


3.2. Image magnification 


In strong lensing with multiple images, the ratio of the luminosity between images 
is an important observable and is used to constrain the mass distribution on the 
local scale inside the lensing galaxy. 

To derive the flux ratio between images, it should be noted that gravitational 
lensing does not change the surface brightness. Therefore, the magnification of the 
image compared with the source is just the ratio of the size between the image and 
source, and thus the magnification of the image is the inverse of the determinant 
of the Jacobian. We also define the magnification matrix as the inverse matrix of 
the Jacobian 


Maa“, (59) 


The magnification of an image is not known because we do not know the lumi- 
nosity of the source. However since there are multiple images, the ratio of magnifi- 
cation is observable and is an important constraint of the mass model of the lensing 
object. Suppose we have two images A and B, then using the magnification matrix, 
we have 


603 = M(0z)i358 = M(Op)M~'(04)604 = MpadOa. (60) 

The flux ratio is very useful to constrain the lens model. Although a simple 
smooth lens potential is enough to predict image positions consistent with observa- 
tion, this is not the case for the flux ratio between images. This may be regarded as 


an indication of the existence of dark matter substructure in the halo of the lensing 
galaxy.°©°? 


3.3. Time delays 


Another possible observable in strong lensing is time difference (or time delay) 
between arrival of light rays in multiple images. (Refsdal®). Since the photons from 
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the source travel along different paths to make the different images, the time taken 
by the photon from the source to the observer differs from image to image, result- 
ing in an observable time delay. Time delays have been observed for a number 
of systems, and were used as a method to measure the global Hubble parameter 
before WMAP measured the Hubble parameter to within several percent accuracy. 
The method is free from any empirical assumptions associated with the method of 
the distance ladder. However this method was actually not very powerful in the 
past because of poor time delay measurements, resulting from simplified and inac- 
curate assumptions about the lens mass distribution. Nonetheless, this method is 
potentially very useful if we have enough information to determine the lens mass 
distribution accurately enough. It has been shown that a single gravitational lens 
with well-measured time delays and a detailed lens model can be used to measure 
time delay distances defined as Da = (1 + z,)DrDs/Drzs (see below) to 6-7% 
total uncertainty (random and systematic Suyu et al.°°). In fact, Suyu et al.°? 
have shown that in quadruple lens systems the lens model inaccuracy can be less 
important than the inaccuracy in the time delay measurements. Their result is 
based on detailed modelling of the gravitational lens RXJ1131-1231 with the spa- 
tially extended Einstein ring observed by the HST and well-measured time delays 
between its multiple images. 

On the order of 100 new quadruple lensed quasars are expected to be discov- 
ered in the near future with imaging surveys such as the Subaru Hyper Supreme- 
Cam Survey, Pan-STARRS-1,°° the Dark Energy Survey (DES).®! When their time 
delay data are available, it is expected that together with supernovae and cosmic 
microwave background information, we can improve the dark energy figure of merit 
by almost a factor of 5, and determine the matter density parameter Q,,9 to 0.004, 
the Hubble parameter Ho to 0.7%, and the dark energy equation of state time 
variation parameter wa to +0.26, systematics permitting (Linder®?). 

Here we explain the basic theory of time delays. If a source produces a short 
burst of light, then the photons of the burst will arrive at a time 


1+2zz,dzrdg |1 
a — 
Hy drs |2 


(9 — 8)? — ¥(8)], (61) 


where dx is the angular diameter distance in unit of c/Ho. The first term is just 
the result of the increase in the path length and the second term is the result of a 
decrease in the speed of light near the gravitating object. 

The time delay between the two images A and B is given by 


AaB = TA —TB 


_ 142, dzrdg | 1 
- Ho drs |2 


(84 — 8)? — (On — B)?) — (W(8a) — ¥(8z))]. (62) 


For practical application of time delays, we have to take into account the fact 
that the lens galaxy is sometimes a member of a small group of galaxies or is 
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under the influence of nearby galaxies. In that case, we consider these effects by 
the external potential in the following form: 


Wel) ~ we(0) + Vure(0) + + 5e(0) (62 + 62) 


1 
+ 3 71c(0) (a a 0°) a Y2c(0)Ax Ay, (63) 


where we take the origin at the position of the lens galaxy and assume that the 
length scale of the external field is large compared with the separation of two images 
and thus neglect the higher order terms. The coefficients are given by 


1 
9 Vena _ We aja) V2 = Weny: (64) 


It is easy to show that the first two terms of the above expansion does not change any 
observables in the strong lensing system. The third term expresses the contribution 
from the uniform sheet. It does have any effect on the time delay. 

We now show that a uniform sheet will bring an ambiguity into the Hubble 
constant determination. We consider the transformation of the potential 


1 
Ke = 9 Vea aR We,yy)s Ve = 


b> (L— ew + 5(62 + 63) (65) 


with a constant c. This changes the bending angle as follows: 


a—(l-c)ja+cé. (66) 


The lensing equation tells us that 9 does not change by changing the source position 
3 — (1— Cc). This will change the magnification matrix as M — M/(1-—c), but 
the relative magnification matrix Mp4 = M pM;' does not change. However the 
time delay equation (62) tells that the combination Hp ATg4 changes as above 


Ho ATBA = (1 = c)HoAtgBa. (67) 


This means that the observed time delay does not change if we can change the 
distance scale at the same time as 


Ao = (1 = c) Ao. (68) 


Thus even if we determine the lens model by image positions, as well as the 
ratio of relative magnification between images and time delay, there remains an 
ambiguity in the measurement of Hubble parameter 


Ho = 100h(1 — x.(0))  [km/s/Mpc]. (69) 


The ambiguity «,(0) is the projected surface mass density near the lens position. 
Weak lensing analysis in the region around the lens galaxy is a direct method to 
determine «,. In fact there have been attempts to measure «-(0) in QS00957+561 
A, B by weak lensing.®?:*4 
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3.4. Comparison of lens model software 


Analysis of strong gravitational lensing relies on a software analysis of observa- 
tional data. There have been many lens model software codes available and thus 
it is important to have a systematic comparison between them. A number of soft- 
ware codes used in research studies are not publicly available. We will review here 
several of the publicly available codes that have been used in previous studies. A 
comprehensive review is available in Lefor, Futamase and Akhlaghi.®° 


3.4.1. Non-light traces mass software 


These codes were formerly referred to as non-parametric, and include: 


e PixeLens: This program uses a non-parametric model with Bayesian statistics. 
Inputs to the program consist of model constants (redshifts, pixel size, etc.) and 
image data. The radius of the mass map in pixels and the redshifts for the lens and 
source must be given. The code reconstructs a pixellated mass map by generating 
large ensembles of models with a Metropolis algorithm (Saha and Williams). 

e Lensview: Lensview is based on the LensMEM algorithm, which finds the best 
fitting lens model and source brightness using a maximum entropy constraint. 
Lensview is used to study lens inversion, obtaining a model of the source based 
on lensed images. Due to its comprehensive nature, it is possible to specify very 
complicated lens models based on one or more components (Wayth and Web- 
ster®’). 

e LensPerfect: LensPerfect uses a parametric model but is also model-free as 
described by its developers, who further characterize it as non-LTM. LensPerfect 
solutions are given as sums of basis functions. While most parametric models are 
based on a physical object, the basis functions used by LensPerfect have no phys- 
ical interpretation. Input to the program is via a text file and graphical output 
is shown immediately on the display (Coe et al.®8). 

e GRALE: GRALE can be used to simulate gravitational lenses and to invert 
lensing systems. The GRALE algorithm uses a non-parametric technique to infer 
the mass distribution of a gravitational lens system with multiple strong lensed 
systems (Liesenborgs et al.®°). 


3.4.2. Light traces mass software 
These codes were formerly referred to as parametric codes, and include: 


e Lenstool: Lenstool was first described in 1993 and has been continually upgraded. 
The current version uses a Markov Chain Monte Carlo sampler to avoid local 
minima in likelihood functions. The software uses a text file for data input and 
describes the model using a wide array of parameters. Multiple lens potentials 
are supported, all at the same redshift. Lenstool is a comprehensive program for 
gravitational lens modeling (Jullo et al.?°). 
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e Glafic: Glafic allows both point and extended sources in the analysis of lensed 
images as well as a large catalog of lens potentials. A text file is used to describe 
the model and image data as well as priors. Glafic can accurately recover lens 
model parameters of known lensing systems (Oguri”°). 

e Gravlens/Lensmodel: This software has a large catalog of mass models and uses 
a text file for commands and data input. The software uses a tiling algorithm 
to determine the number and position of lensed images associated with a given 


source, and uses a polar grid centered on the main galaxy (Keeton"?). 


This list of software is by no means comprehensive but provides a list of 
some of the commonly used software for strong gravitational lens models avail- 
able for public download. All the lens model software described here is available 
directly from, or using links provided by, the Astrophysical Source Code Library 
(http: //www.ascl.net). 


3.5. Lens statistics 


The statistics of strong lensing events have interesting applications to cosmology. 
There have been many studies of statistics using quasars, giant luminous arcs and 
recently GRAMORs. It has attracted much attention because the statistics are 
sensitive to the cosmological constant via the cosmological volume element (e.g. 
Turner”?; Fukugita, Futamase and Kasai®°). As shown below, a detailed and accu- 
rate knowledge of the properties of the source and lensing object is necessary to use 
lens statistics as a useful method to measure the cosmological parameters and the 
redshift evolution of the lensing galaxies (e.g. Kochanek”?; Ofek et al.’4; Chae and 
Mao”; Matsumoto and Futamase”; Chae’’; Cao and Zhu®?). Furthermore, it has 
been shown that sample incompleteness can bias the results significantly (Capelo 
and Natarajan"’). Recent large scale galaxy surveys such as SDSS overcame these 
difficulties and made the statistics a useful tool in observational cosmology (see 
Oguri et al.”). 

Here we explain the basic formulation of quasar statistics as an example of lens 
statistics. Consider the probability that a quasar with redshift z,; is lensed by a 
lensing object with redshift range zp ~ zz, + dz,. Writing the cross-section as oy, 
the differential probability may be written as follows. We have 


dr(zi) = n(zL)oL 


cdt 

—|dzy, 70 
dzp | “ ( ) 
where n(zz,) is the number density of lensing objects. The cross-section is given by 
solving the lens equation once we choose an appropriate lens potential, and depends 


on the redshifts of the source and the lens as well as the relative position of the 
source and lens. 


es i Pa) (71) 


p®(L) ’ 
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where the integration region S$ in the source plane is over the position where multiple 
images are produced. Here we include the magnification bias. This is necessary 
because some observed lensed quasars are magnified and become brighter than a 
limited magnitude. Thus we need a magnification factor for each source position 
and the quasar luminosity function ®(L). The cosmological dependence is in the 
quantity cdt/dzz, which is calculated in FRW geometry to be 


cdt c 1 
dz, -Ho(1+ 21) /Omo + 2z)2> + 1 — Om — Qao)A +2)? +90 


(72) 


where Q,,,9 is the matter density parameter and Q,,9 = A/ 5H is the normalized 
cosmological constant. 

In practical applications, the number density of the lens object is calculated 
using the luminosity function ®(L) = dn(L)/L or the velocity function w(o,) = 
dn(oy)/doy,. For example, using the velocity function the differential probability 
of lensing per unit redshift with an image separation between @ and 0 + dA@ is 
given by 


ar 28 
dAédz, =| doyb(ov)(1 + zz) dz, dA@’ (73) 


A recent study based on the lens sample from the Sloan Digital Sky Survey 
Quasar Lens Search (SQLS) (Oguri et al.”) used 19 lensed quasars selected from 
50,836 source quasars where the velocity dispersion of the lensing galaxies sug- 
gested by SDSS data (Bernardi et al.8°) and the quasar luminosity function for 
0.4 < z < 2.3 measured by the combined analysis of SDSS and 2dF data (Croom 
et al.®') are used. It is shown that the SQLS sample constrains the cosmologi- 
cal constant to Qa. = 0.79¢0+°(stat.)dG0o(syst.) for a flat universe. It is also 
shown that the dark energy equation of state parameter is consistent with w = 1 
when the SQLS data is combined with constraints from baryon acoustic oscillation 
(BAO) measurements or results from the Wilkinson Microwave Anisotropy Probe 
(WMAP). Furthermore, no redshift evolution of the galaxy velocity function at 
z <1 is found. 


4. Weak Lensing 


Although strong lensing is an important and spectacular phenomenon, it is a rare 
event and most light rays from distant sources propagate through relatively weak 
gravitational fields such as present in the outskirts of clusters and regions sourced 
by the large scale structure. In this case the intervening gravitational fields only 
slightly distort the shape of background sources, resulting in a systematic distortion 
pattern of background source images known as weak gravitational lensing. In the 
past decade, weak lensing has become a powerful, reliable method to measure the 
distribution of matter in clusters, dominated by invisible dark matter (DM), without 
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assumptions concerning the physical and dynamical state of the system.®? ®! Not 
only clusters, but also weak lensing due to large scale structure, attracts much 
attention because it carries information regarding the growth of structure which is 
affected by the nature of dark energy. For a general review of weak gravitational 
lensing, see Bartelmann and Schneider®? and Umetsu.??}°4 


4.1. Basic method 


We shall explain one method of mass reconstruction using weak lensing mass obser- 
vations in some detail. For clarity of the argument we treat the simplest situation 
in this subsection. The complexities are mentioned later. 

First we select a sample of background galaxies for each lensing object consid- 
ered. Since the lensing signal depends on the redshifts of the sources, it is necessary 
to have redshift information for accurate determination of the mass distribution 
of the lensing object. For example, in a weak lensing analysis of galaxy clusters, 
the contamination by faint member galaxies dilutes the lensing signal, in particular 
in cluster central regions.®° In the case of cluster lensing, the background galaxies 
are selected by color information. Namely, their colors are redder than the color- 
magnitude sequence of cluster member galaxies due to large k-corrections (for more 
detail, see Medezinski et al.,°° Okabe et al.*4 However, more detailed redshift infor- 
mation regarding the background galaxies is necessary for accurate determination 
of cosmic shear (see below). 

Once we define the sample of background objects, the next step is to mea- 
sure the shape of each background object. Several shape measurement schemes 
have been developed. We concentrate here on the so-called moment method, which 
uses multipole moments of brightness distribution to characterize the shape of the 
background objects (KSB®? and Okura and Futamase®’). Although this is not the 
optimal method as seen in Fig. 10 below, it is still the most commonly used method. 

In the moment method we measure the quadrupole moments of surface bright- 
ness for each background object 


obs 
Se Fi @?0A0;A0;1(8), (74) 
where J(0@) is the surface brightness distribution and AO; = 0; — 00, with 00,; 
the center of the image. Below we define the center as the point at which the 
dipole moment vanishes. Then we define a purely spin-2 quantity from the measured 
quadrupole moments 


obs obs obs 
aC a 
obs obs) ” obs obs 
air + af’ of +08” 


which is called the ellipticity. For another definition of ellipticity, see Bartelmann 
and Schneider.®? Similarly we define the quadrupole moments G and ellipticity 
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eS) for each of the corresponding sources which are not observable. Using the lens 

P g g 
equation we find the relationship between the ellipticity of the observed image and 
of the intrinsic source 


om (obs) _ 29 oe gent) 
1 + |g|? — 2Re(ge(bs) *) ° 


(76) 
In the weak lensing limit 
ght? ew g OPE) _ ig, (77) 


Since we can expect that the intrinsic ellipticities of nearby galaxies on the 
sky do not correlate with each other, we can average over a certain number of 
background galaxies to obtain 


obs, | Oe 
(€ )~29 +0 (5), (78) 
where o, ~ 0.3 —0.4 is the dispersion of the intrinsic ellipticity of galaxies, and N 
is the number of background galaxies averaged over. The more galaxies we average 
over, the more accurate the estimation of the gravitational shear we have. In the 
best seeing condition at the summit of Mauna Kea we may have N > 50 per 
arcmin”. In a realistic situation, the number is less than 30 or so. In the case of 
cluster lensing, the lensing signal is on the order of 0.1 and thus we can safely detect 
the lensing signal by clusters. 

Once we have the averaged shear field we can convert it to a convergence map 
which is the normalized surface mass density using the relationship between the 
convergence and the shear. Although this is the essence of weak lensing mass recon- 
struction, things are much more complicated for real observations. 
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Fig. 10. Stellar ellipticity distributions before and after PSF anisotropy correction. The left panel 
shows the observed ellipticity components (e},e4) before PSF correction, and the right panel 
shows the residual ellipticity components (de], 6e3) after PSF anisotropy correction (Umetsu and 
Broadhurst?°"), 
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4.1.1. Shape measurements 


In practical observations of weak lensing, the observed image is not the lensed image 
but rather the image smeared by various sources of noise such as the distortion by 
atmospheric turbulence, imperfect optics, pixelization by the CCD chip, and photon 
noise. Thus we have to correct these effects before applying a mass reconstruction 
method. 

It is supposed that these effects are expressed by the point spread function 
(PSF) P(@) as follows: 


ros) (9) = / PbT(p) P(O — ). (79) 


In order to improve the accuracy of weak lensing analysis, we need to reduce 
various sources of noise and to correct the PSF effects without introducing other 
errors. 

In this review we focus our attention on PSF correction in some detail because 
this has the most important effect in weak lensing analysis. There are several meth- 
ods for PSF correction available today. Here we employ the moment-based approach 
(see Kaiser, Squires and Broadhurst®*; hereafter KSB). In realistic observations, 
since the observed shapes of background galaxies are rather noisy, particularly for 
small faint galaxies, we need to introduce a weight function W to reduce the random 
noise in the outer part of the image, so that 


obs obs 0 
| _ 00.051! >) (@)(@)W (=) ; (80) 
where g is a typical scale for the galaxy. The simple choice is the spherical Gaussian 


shape: 


W (2) wert, (81) 


on 


It is known that this choice introduces a systematic error in measuring the shape 


that overestimates the shear for images with large ellipticities. The error is overcome 


by introducing an elliptical window function (see Okura and Futamase®> 1°), 


The PSF correction proceeds in three steps. First we decompose the PSF func- 
tion into isotropic and anisotropic part as follows: 


P(0) = / d?6'q(@') P=) (0 — 6’), (82) 
where g, P“®°) are normalized and have vanishing 1-order moments. 


/ déq(0) = / doP)(@) = 1, (83) 


f d00;q(0) = / d00; P=) (8) = 0. (84) 
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(i) Anisotropic PSF correction 
We define 


1(69)(9) = [ a?6'1(6')P°™)(6 — 6) (85) 
Then the observed intensity may be written as follows: 
10)(9) = [ ab’ q(o — 6')1(6") (86) 


Assuming a nearly spherical shape of the anisotropic PSF function, we only 
consider the quadrupole moment of the anisotropic PSF function 


aij = | €04(8) a (87) 


Then the quadrupole moment Q 8°) calculated from I“S°) is related to the 
observed quadrupole moment as follows: 


1 
QE) = QO) — 5 Zijnedee, (88) 


where qi = qi1 — 922,92 = 2q12 and 


29 7 (obs) oP |a/? 
J 


i 
Using the definition of ellipticity and the above expression, we find the ellipticity 
corrected by the anisotropic PSF effect is 


els) = efoPs) _ pamg,, (99) 


where P*™ is called the smear polarizability which expresses the response of the 
ellipticity against the anisotropic PSF effect (a detailed expression may be found 
in the original paper by KSB and the review by Bartelmann and Schneider). From 
dimensional arguments P*™ « rg (where rg is the Gaussian scale of the object), 
and thus the PSF anisotropic effect becomes important for objects with small S/N. 
The above correction requires a knowledge of q; which is given by applying 
the equation to stars. Since stars do not suffer lensing and their images have no 

ellipticity after isotropic smearing, e 8°) = 0 and thus we have 
gh = (Prem yieg (91) 


a 


The quantities with an asterisk denote quantities associated with stars. To 
obtain the smooth map of q; used in Eq. (97), the co-added mosaic image is divided 
into small chunks with scales determined by the typical coherent scale of PSF 
anisotropy patterns. By doing this, PSF anisotropy in individual chunks is well 
described by fairly low-order polynomials. Typically the rms value of stellar ellip- 
ticities is reduced from a few % to (4—8) x 1073 after anisotropic PSF correction. 
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Figure 10 shows the distribution of stellar ellipticity before and after the PSF 
anisotropy correction using the Subaru data for the cluster Abell 1689 (Umetsu 
and Broadhurst?°"). 


(ii) Isotropic PSF correction 

Next, we have to correct for magnification due to the PSF effect. For this purpose 
we consider the intensity [ (3) in the source plane corresponding to I“*°) (6). If we 
write the lensing equation as B = A@, then we can write 


1@80) (9) = [(A@). (92) 


The inverse relationship may be written by introducing the PSF in the source 
plane and may be written as 


(8) = [ @s'r(6')P(8 - 6, (93) 
where we have introduced the PSF in the source plane corresponding to the isotropic 
part of the PSF in the image plane, so that 

A 1 


P(g) = a (A-*@). (94) 


This function is normalized and has a zero mean. Note that P does have an 
anisotropic part from the lensing effect. Thus we decompose into isotropic and 
anisotropic parts, so that 


pp) = | #9 PBB - 6’) (95) 
Then the same argument in the image plane applies here to give 
é) = é; — P2"G;,, (96) 
where €° is the ellipticity of the intensity [° defined by 
(8) = [ @'7q( - 8°), (97) 


We have used [‘°s) instead of f in P*”" as before. 
The final step is to relate the artificial ellipticity é° to the isotropically smeared 
ellipticity. This is simply obtained by using the relationship between JS°) and 


I, which is defined above. Using this relationship, the quadrupole moment of the 
artificial source I is calculated to be 


Qy = [e93.9;,Ae)W (2) (98) 


|a/? = opal 


o2 


= (det A) Ain, Aje / on.onr=(ayW ( (99) 
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where 6? = (1—«)?(1+|g|?)o? and 6; = 2g;/(1+|g|?). Under the assumption that 
g is small, we can replace I*°) in the observed image J‘°PS), and then by expanding 
the window function in the first order of 6;, we obtain 


ef) = + Pah gy, (100) 
where P*” is called shear polarizability and expresses the response of the ellipticity 
against the shear. The detailed expression may be found in the original paper which 
describes KSB. 

Combining these two relationships, we have 


ep) = 6) +f Pegg + PHP Gy, (101) 
By using star images, we have 
a *,SM\— *, sh 
Gi = —(P* ee st Ij: (102) 


Finally, by using the relationship between e“*®°) and the observed ellipticity 
e(°s) | we finally have 


é° = eh) _ pag, — PS g,, (103) 
where we have defined 
s sm *,8M)\— *,sh 
Pe = A =F Vee Pes : (104) 


Since the intrinsic ellipticity of background galaxies do not have a correlation, 
averaging gives us 


(g:) = (PNG te” — PF" a), (105) 


with q = Gam 
As seen from the above argument, one has to make some assumptions such as a 
small anisotropic part q and the use of the observed image J‘°S) in the evaluation of 
smear polarizability and shear polarizability. These assumptions caused systematic 
errors in the original KSB weak lensing analysis. There have been some proposals 
to improve the accuracy of the analysis. Erben et al. have proposed using the trace 
part in PY to reduce the noise which creates non-diagonal components in P9. 
P49 


Sty 


= 5Ir(P 6 = Poop: (106) 


(Hudson et al.!°?; Hoekstra et al.!°%; Erben et al.!°4; Hetterscheidt et al.!°°). The 
diagonal parts are measured for individual galaxies, but they are very noisy for 
small and faint galaxies. To avoid the noisy part, an averaging in object-parameter 
space was introduced by Van Waerbeke et al.3°; see also Erben et al.!; 
et al.1°6 


Hamana 
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Although in the KSB method the result in principle should not depend on the 
choice of the scale in the weight function, this is not the case for real observations. 
Hoekstra et al.'°3 have shown that the scale used in the weight function of galaxies 
gives a better result. 

In practice, the averaged shear over the observed region is estimated by intro- 
ducing the weight function u,,; such that 


1 


=. 107 
oor ae a2 ? ( ) 


oe ao 
where J labels the object in the region, the constant a is the softening parameter and 
oy is the rms error of the complex distortion measurement. Usually a ~ (03) ~ 0.4 
is used, which is a typical value of the mean rms o, over the background sample. 
Then the averaged shear is given by 


(gs(8)) = Leo Or tt 

do g(O — Or )ur 
where gj,7 is the reduced shear estimated of the /th galaxy at an angular position 07, 
and w,(0) « exp(—6?/0?) with 6, = FWHM/¥V41n2 which pixelizes the distortion 
data into a regular grid of pixels. The error variance for the above smoothed shear 
is given as 


(108) 


2 2 2 
> wg 1g TOLL 
ete | 


0%, (0) = $y, 
(= ono) 
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where Wg,7 = Wg(9 — 87) and the relationship (gi,r9;,7) = (1/2)0% 76i3617 has been 
used. 

The smoothing scale is chosen to optimize the weak lensing detection of target 
mass structures, depending on both the size of the structure and the strength of 
noise power (x a7 /ng). 

Once the smoothed shear field is thus obtained, it can be converted to the 
convergence field using the relation K(k) = 7D*(k)y(k). Actually this inversion 
method has to be applied in an infinite space. For a finite space one need to use 
the finite-field solution of the inversion problem for the reconstruction kernel. Such 
method has been developed by Seitz and Schneider.!® For a sufficiently wide field 
where the data field is not dominated by a positively or negatively biased density 
field, both methods give the almost same convergence field. This is not the case for 
a nearby cluster for which we have to use a finite field method. 


(109) 


4.2. E/B decomposition 


Observations and data analysis suffer from errors and noise. In particular, weak 
lensing analysis needs many corrections such as for the PSF distortion to isolate 
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a physical signal of a weak tidal gravitational field. If the corrections are not per- 
fect, they may systematically change the signal. Thus it is critically important to 
characterize the effect of systematic errors on the final results. 

Here we consider a method for the evaluation of systematic errors called E/B 
decomposition. This method is based on the fact that gravitational shear is gener- 
ated by a Newtonian potential in the lowest order, and thus the gravitational field 
is rotation-free. 

Let us first define a new 2D vector field u from the reconstructed convergence 
k by 


u=VK= eae (110) 
21 — J,2 
Using the vector field, we define the F-mode « and B-mode xk as follows: 
Ve” = V ou, (111) 
V«nF =Vxu= u2,1 — U1,2; (112) 
which gives both E-mode and B-mode potentials 
Vor aay. (113) 


Note that the gravitational effect is given by the Newtonian potential WU in the 
lowest order so that #6” = W and ¢? = 0, but in practical observations, 6” 4 V and 
¢? #0. The complex shear may be expressed by the complex potential ¢ = dg+idp 
as follows: 


1 1 
v= |5 (0% - 0%) - 0%] +i[o% +5 (6 -0)] 4 


This expression shows that the transformation 7/(@) = i7(@) is equivalent to 
an interchange of the E- and B-modes of the original maps. By noticing that the 
shear transforms as y/ = ye?"? under the rotation of an angle ¢, this operation is a 
rotation of each ellipticity by 7/4 with each potential being fixed. Figure 11 shows 
the E and B mode shear. 

There are several sources of B-mode shear. One artificial source is PSF residu- 
als. Since the PSF correction so far proposed adopts some sort of approximation, it 
gives rise to B-mode shear. In addition to artificial B-modes, there are also phys- 
ical B-modes. Intrinsic ellipticities of background galaxies contribute to the shear 
estimate. By assuming randomness of orientation of the intrinsic ellipticities, such 
uncorrelated ellipticities yield statistically identical contributions to the E- and 
B-modes. 

By increasing the number density of background galaxies, these modes may be 
reduced. Another physical B-mode is due to the existence of intrinsic alignments, 
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Fig. 11. Shear patterns of E- and B-modes (Van Waerbeke and Mellier!°°). 


which may be generated by tidal forces between halos of galaxies. There are two 
kinds of intrinsic alignments. One is called II correlation. Galaxies around a large 
halo may have some correlation of the orientation of ellipticity by a tidal gravita- 
tional force. This effect creates both E- and B-modes. It is reasonable to assume 
that the effect only applies to nearby galaxies, and thus the effect is eliminated 
by excluding nearby galaxy pairs from the background sample.!°? Another effect 
is called GI correlation, which is the correlation between the intrinsic ellipticities 
and large scale structure.!!° This is because the ellipticites are influenced by a local 
density field and the same density field causes weak lensing for high-z galaxies (cos- 
mic shear). Thus the B-mode signals serve as a useful null check for systematic 
effects. 

Finally we highlight an ambiguity in weak lensing analyses. As pointed out 
above, the relationship between the convergence and the shear is not unique. Any 
constant mass sheet does not change the shear field which is obtained by the weak 
lensing limit. This is called the mass-sheet degeneracy. Actually as shown above, 
the observable is not shear itself but the reduced shear 


g(8) = eh (115) 


This field is invariant under the following global transformation: 


K(O) > AK(9) +1—A,  7(A) > AY(4), (116) 


where A ¥ 0 is an arbitrary constant. There is a method to break this degeneracy 
once one realizes that the above transformation is equivalent to the following scaling 
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for the Jacobian matrix: 
A(0@) — AA(8). (117) 


One can exploit magnification effects since the inverse of the determinant of the 
Jacobian is the magnification. 


4.3. Magnification bias 


Recent advances in weak lensing observations made the measurement of the magnifi- 
cation bias possible by measuring the depletion of the surface density of background 
objects. There are two effects as a result of lensing magnification. One is the expan- 
sion of the area of sky. The other is the amplification of the apparent magnitude 
which is given by 

ms — m+ 2.5 log p, (118) 
where m is the apparent magnitude without lensing. These effects change the 
magnitude-limited number density n(< mim) of the background objects, so that 


1 
n(< Mim) = ue Mim + 2.5 log 11), (119) 


where mim is the limiting magnitude and ng is the unlensed number density count 
per solid angler. Let us suppose that no near the limited magnitude obeys a power 
law, 


dlog no(< m) 
Q = 


; 120 
dm . 20) 
where a is the power law index around m = Mrmjim. Then we have 

n(< Mim) = no(< jimiee (121) 
Since ps > 1, n(< mim) < No(< Mim) if a < 0.4. The ratio is 

n < Mim) 2.5a—1 

p= — iw P01 1 4 (a — 9), 122 
oe ( ) (122) 


where we have used the weak lensing limit in the final step. 

For red background galaxies at a median redshift ~ 1, the intrinsic power law 
index at faint magnitudes is observed to be relatively flat a ~ 0.1 and thus causes 
depletion of the number count. This depletion has been observed in several massive 
clusters (Broadhurst et al.9°; Umetsu and Broadhurst!®!; Umetsu et al."*). 


4.3.1. Simulation test 


Although many improvements in weak lensing analysis have taken place, systematic 
errors in shape measurement must be reduced by a factor of 5-10 by the time 
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the next generation large scale galaxy surveys are conducted. It is important to 
test the accuracy of shear measurement methods. This is done by using simulated 
data with realistic complicated galaxies and PSF effects such as the GREATO08 
and GREAT10 challenges (Bridle et al.11?113; Kitching et al.114:116119) and STEP 
(Heymans et al.!!"; Massey et al.18). 

The accuracy of these methods is measured by the two parameters m and c, 
defined by 


ane = Sel = myirue +e (123) 
where m is called the calibration bias and c is called PSF residual. Figure 12 shows 
the results using various methods of weak lensing analysis.!!9 


4.3.2. Higher-order weak lensing-flexion and HOLICs 


In the usual treatment of weak lensing, the quadrupole moments of the shape 
are measured to evaluate the ellipticity which are spin-2 quantities. However, the 
quadrupole moments are not enough to characterize the shape of galaxies in general. 
In fact, there are many background galaxies, in particular small faint galaxies with 
shapes not well described by simple ellipses. 

It would be very hard to describe general complicated shapes mathematically, 
but if the shape deviates slightly from an ellipse, higher-order moments such as the 
octopole moments may be useful. Remembering that the spin-2 combination is con- 
structed by a combination of the quadrupole moments, the quantities with definite 
values of spin should be constructed by appropriate combinations of higher order 
moments (Okura, Umetsu and Futamase!?'). Such quantities are called Higher 
Order Lensing Image’s Characteristics (HOLICs). Here we concentrate on the octo- 
HOLICs constructed from octopole moments. 


i, -5 
x10 x10 
1 5 
+ KSB 
0.5 4 2.5 * method 4 
& a x im3shape NBCO 
dl , © DEIMOS C6 
< KSB f90 
a 2 + ee 207 x ARES 50/50 
> 4d <J_NN23 func 
[> MegaLUTsim2.1 b20 
-0.5 -2.5 x  gfit 
* shapefit 
* 
+1 5 
-0.2 -01 0 O01 02 -0.05 -0.03 -0.0100.01 0.03 0.05 
m m 


Fig. 12. Left: The multiplicative m and additive c biases of various measurement methods are 
shown. The symbols indicate the method (shown in the right panel). The central figure expands 
the x- and y-axes to show the best performing methods. The figures are cited from Kitching 
et al.119 The detailed description and references of the methods are also found there. 
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The following quantities are the octo-HOLICs: 
(Qita + Qiz2) + 1(Qi12 + Q222) 
g 
Qit1t — 3Q122) + i(3Q112 — Q222) 
€ ) 
where Qijx and Qijxe are the octopole and 16th pole moments, respectively and € 
is the normalization factor defined as 


¢= ; (124) 


ge) 


(125) 


€ = Qii + 2Q1122 + Qo222. (126) 


The first HOLICs ¢ has spin-1, and the second HOLICs 6 has spin-3. These 
HOLICs have dimensions of [angle]~!. As the spin-2 ellipticity is related to gravita- 
tional shear which is the second derivative of the lens potential, the above HOLICs 
are related to the third derivative of the lens potential which is called flexion (Gold- 
berg and Bacon, 2005; Bacon et al., 2006). Higher-order shape change is calculated 
as 


1 
ade; = Aj;d0; + 3 Pind dO + O(6°), (127) 


where Dijk = Aij,n = —Wijx (Bacon, Rowe & Taylo, 2005). Using this equation, we 
can show a relationship between the intrinsic HOLICs and the observed HOLICs. 
By linearizing the relationship we have 


9 F 
CS) eu Fe ™ 
(a ¢-F oe, (128) 
3.°G 
(he 12 
é6 é6 = (129) 
where F' and G are defined as 
F=0K, G=0%, (130) 


where 0 = 0, + i02 is the complex gradient operator. Thus F' has spin-1 and is 
called the first flexion. G has spin-3 and is called the second flexion. By averaging 
over the background galaxies, we know that the HOLICs are an estimator of the 
flexion. So far, the spin-1 HOLICs ¢ has been measured in Abell 1689 and used for 
mass reconstruction (Leonard et al.!??; Okura, Umetsu and Futamase!”!; Leonard 
et al.!?8; Cain, Schechter and Bautz!**). 

For recent development and other approach for Flexion analysis, see, for exam- 
ple, Viola, Melchior and Bartelmann,!?° Rowe et al.!2° and Levinson.!?7 


4.4. Cluster mass reconstruction 


Since weak lensing analysis cannot resolve mass structure smaller than a finite scale 
on the order of 1 arcmin, it is applied mainly to clusters of galaxies and to the large 
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scale structure of the universe. Clusters of galaxy are the largest self-gravitating 
objects in the universe, and thus their statistical properties contain indispensable 
informations on the initial density fluctuations and their nonlinear growth. The 
most important such information will be their mass distribution. On the other hand, 
a standard structure formation scenario based on CDM and a cosmological constant 
predicts that larger structures are the result of assembly of smaller structures and 
that the spherically averaged CDM halo is well described by the so-called NFW 
profile.!’° Weak lensing thus plays an important role to study cluster properties. 
In this subsection, we explain frequently used techniques for measuring the cluster 
mass profile and then give two examples of weak lensing observations. For a more 
comprehensive treatment of cluster weak lensing, see the reviews by Umetsu?? and 
by Kneib and Natarajan.!” 
It is convenient to write the critical mass density in the form 


C Des(Zs) 


Neri = ae 2 
s inGD," Blze ° ) Ds(Ze) 


(131) 


where 3(z,) is a function of the source redshift z, with fixed zp and represents the 
geometrical strength of lensing for a source with redshift z,, and thus is called the 
lensing depth. If the source redshift distribution is given by N(z), the mean lensing 
depth is 


[ ex@e@ 


= = (132) 
| dzN(z) 
0 
It is convenient to use the normalized mean lensing depth defined by!”” 
(8) 
= —— +. 133 
we (133) 


The convergence and shear may then be written in terms of the corresponding quan- 
tities for a fiducial source at infinite redshift as «(@) = wk(@) and y(@) = woo (8). 
In this way, the redshift information of background galaxies is taken into account. 
Actually the observable is the reduced shear g = y/(1 — «) which is nonlinear in x. 
The averaging by source redshift is complicated operation. The detailed treatment 
of the averaging is given by Seitz and Schneider!” and by Hoekstra et al.7° 

As explained above, the tangential shear g; is a coordinate independent quan- 
tity for a given reference point. For cluster lensing, the reference point is natu- 
rally the center of the cluster, which can be determined from the symmetry of the 
strong lensing pattern, the X-ray centroid position, or the position of brightest cD 
galaxy. It is also convenient that the cross-shear g, may be used as an estimator 
of the systematic error since gx is the divergence-free, curl-type distortion pattern 
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of background images. The reason to use tangential shear for measuring the mass 
profile is that the azimuthal average of the tangential shear allows us to directly 
compare with the theoretical model by using the relationship 


(7+)(9; 80) = R(< 0,80) — (K)(9, 80), (134) 


where (--) is the azimuthal average and &(< 0,00) is the average of the convergence 
within the radius @. In the following, we understand that @ is the radius from the 
origin 09 and will not write the origin. The above relationship is obtained by the 
definition of & in the following form: 


0 
0°R(< 0) = 2 | d0'0' (K(0)) = a pews. (135) 


where we have used the fact that 24 = Ad, and the 2D version of Gauss’s theorem. 
By taking the derivative with respect to 9 and using the relationship 


ao 
a2 (136) 
we have 
= A) OR(< 0) + O[(«) (0) — (y2)(8)). (137) 


This is the required relationship by noting d(@?K/)d@ = 2(k). 
It is also useful to define the following quantity, over the annulus bounded by @ 
and @out located outside the mass to be measured: 


C(O, Aout) = R(< 0) — (0 < 0 < Oout), (138) 


where @ and Out are the inner and outer radii of the annulus centered on some 
position @9 usually taken at the position of the brightest galaxy in the cluster. This 
is referred to as ¢-statistics (Fahlman et al.!28). Using the above equations, we can 
derive 


Bout 
C(6, Pout) = ay din6(¥,(0)). (139) 


Thus ¢ is directly obtained by observing the image distortion of background 
galaxies in the weak lensing regime where ||, |y| < 1. Since K(0 < @ < Oout) is 
positive definite, R(< 6) > ¢(@, Pour). Thus we can define the lowest projected mass 
contained within the radius @ as 


Me(< 2) _ m(Da9)? Xer€ (0, Dout)- (140) 


By fixing the outer radius ou, sufficiently large (but not so large to contain 
neighboring clusters), ¢(@, Aout) as a function of 6 is called the radial profile. This 
profile does not depend upon the invariance transformation. We can fit the observed 
radial profile with some theoretical models. 
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Using the technique explained above one can observe the mass distribution of 
galaxy clusters. The structure formation scenario based on CDM makes particular 
predictions for the averaged density profile of the main halo and statistical prop- 
erties of dark matter subhalos. Thus the measurement of the mass distribution by 
weak lensing will give us a direct test of the CDM structure formation scenario. 
There are several on-going projects aiming at an accurate measurement of the clus- 
ter mass profile.!?9 133 Here we give two examples of weak lensing observations of 
clusters performed using the Subaru telescope. 

If many lensed images with redshift information are observed in the central 
region of a cluster, a very accurate determination of the mass distribution in the 
central region is possible. Also the magnification bias is observed in some massive 
clusters. One can then combine all information to acquire an accurate model inde- 
pendent determination of mass distribution over a wide range of cluster radii. The 
CRASH survey!” is designed for this purpose, combining high-resolution 16-band 
HST imaging with wide-field Subaru observations for a sample of 20 X-ray selected 
relaxed clusters (Zitrin et al.18°; Coe et al.18'; Umetsu et al. 2012). The Bayesian 
approach to combine this information in a joint mass-profile analysis has been pro- 
posed by Umetsu et al.!82:183 For another approach to combine the strong lensing 
and weak lensing, see Merten et al.!*4 


4.4.1. Density profile 


It has been known that the density profile of a collapsed object is well described 
by the NFW profile regardless of scale, as the result of numerical simulations.” 
Here we show the result of our measurement of the mean density profile by stacking 
the weak lensing signals of approximately 50 mass-selected clusters in the redshift 
range 0.15 < z < 0.3 (Okabe et al.34). These are selected from the ROSAT All Sky 
Survey catalogs (Ebeling et al.13+1%°; Boehringer et al.1%°) that satisfy Lx [0.1 — 
2AkeV]/E(z)*" > 4.2 x 10“ ergs 1,0.15 < z < 0.30,ng < 7 x 107° cm, and 
—25° <6 < +65°, where E(z) = H(z)/Ho is the normalized Hubble parameter. 

As mentioned above, the selection of background galaxies is important for an 
accurate mass reconstruction. The sample of background galaxies basically consists 
of red galaxies satisfying the condition AC = (V — 7’) — (V —i’)gso > 0 for each 
cluster. We made a positive color cut AC’ > 0.475 to eliminate contamination by 
faint red cluster members due to statistical errors and possible intrinsic scatter in 
galaxy color (for details, see Okabe et al.**). This results in just 1% contamination 
of the sample by foreground and cluster members. 

Each individual cluster is detected at a typical peak signal-to-noise ratio 
(S/N) ~ 4 using the 2D Kaiser—Squires mass reconstructions. The signal is stacked 
by the shear catalogs in physical length units centered on the respective bright- 
est cluster galaxies (BCGs) and the average cluster mass distribution for the 
full sample, with a peak S/N of 28. The stacked lensing mass measurement is 


less sensitive to cluster internal structures and 3-dimensional halo orientation. 
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Fig. 13. Stacked tangential shear profile (AN+ ~ Der zz) of 50 clusters in units of projected 
mass density, where different cluster and background redshift galaxies are weighted by the lensing 
kernel. The projected radius is computed from the weighted mean cluster redshift. The solid, 
dashed, dotted and dashed-dotted curves are the best-fit NFW, SIS, gNFW, and Einasto profiles, 
respectively. The lower portion shows the results of the 45° test for systematic errors (Okabe 
et al.34). (For color version, see page II-CP12.) 


Thus it enables us to measure an averaged cluster mass profile for the sample. 
When the signals from 50 clusters are combined, the number density of back- 
ground galaxies becomes 266.3 arcmin *. The B-mode signal is smaller by at least 
an order of magnitude than the E-mode signal. Figure 13 shows the stacked 
shear profile over the radial range 100h~'kpc < r < 2.8h~!Mpac. This shows 
that the profile is well described by the predicted NFW profile with parameters 
Myir = 7.191028 x 10!4h-1Mo and cyir = 5.4179;42 with less than 10% statistical 
error. The concentration parameter obtained from stacked lensing profile is broadly 
in line with theoretical predictions (e.g. Bhattacharya et al.18"). No evidence of 
departures from the NF'W profile is found. 


4.4.2. Dark matter subhalos in the coma cluster 


According to the CDM structure formation scenario, less massive halos are accreted 
into more massive halos, which are then subsequently eroded by effects combined 
with tidal stripping and dynamical friction of the host halo, eventually becoming 
a smooth component. In this process the central regions of subhalos have survived 
under the overdensity field, and constitute their population. Numerical simulations 
of, and analytic approaches to CDM based structure formation scenario predict 
that subhalo mass functions at the intermediate and low mass scales follow a power 
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law, dn/dinMsuy « Mz ¥ with a ~ 0.9-1.0 (e.g. Taylor and Babul!*8"4°; Oguri 
and Lee!!; van den Bosch et al.!42; Diemand et al.!4?; De Lucia et al.'44; Gao 
et al.;!4° Shaw et al.!4°; Angulo et al.'4’; Giocoli et al.;'48 Klypin et al.!4°; Gao 
et al? Wau. et al"), 

Thus the statistical properties of cluster subhalos such as mass function and 
spatial distribution, provide us with useful information about the mass assembly 
history. Their measurement is the most stringent test of CDM predictions on scales 
of less than several Mpc. A characteristic feature of the subhalo mass function is 
also critically important to constrain the nature of dark matter, because it depends 
on the particle mass of dark matter. Furthermore, measurement of the correla- 
tion between member galaxies and subhalo masses sheds important insight on the 
physics of galaxy evolution associated with dark matter. Thus it is important to 
measure the mass function directly from observations without assuming a relation- 
ship between dark matter and luminous matter and the dynamical state of the 
system. 

Weak lensing observations of very nearby clusters (z < 0.05) are ideal for this 
purpose because their large apparent size enables us to easily resolve less massive 
subhalos inside the clusters and provides a correspondingly large number of back- 
ground galaxies, which leads to low statistical errors and compensates for the low 
lensing efficiency to achieve a high S/N. 

We show the results of a 4.1 deg? weak gravitational lensing survey of subhalos in 
the very nearby Coma cluster at redshift z = 0.0236 using the Subaru/Suprime-Cam 
(Okabe et al., 2012). The observed area is about 80% within r299 inside of which the 
mean density is 200 times the critical density at the cluster redshift. Note that one 
arcmin corresponds to 20h~! kpc for the cosmology with Qm,o = 0.27, Qa,o = 0.73 
and Hp = 100 hkms~!'Mpc7!. 

Figure 14 shows the projected mass distribution with a smoothing scale of 
FWHM= 4’. and units of significance of vy = «/o,. As shown in this figure, 32 
subhalos were detected and their masses are measured in a model-independent 
manner, down to the order of 10° of the virial mass of the cluster. All of them 
are associated with a small group of cluster members. The mean distortion profiles 
stacked over subhalos show a sharply truncated feature well-fitted by a Navarro— 
Frenk—White (NFW) mass model with the truncation radius, as expected due to 
tidal destruction by the main cluster. 

It is found that subhalo masses, truncation radii, and mass-to-light ratios 
decrease toward the cluster center. Figure 15 shows the subhalo mass function 
dn/dlnMgup constructed by weak lensing mass measurements. It covers a range of 
two orders of magnitude in mass and is well described by a single power law 


dn 
i _ gee 141 
dinM., ¥’ ee 


. = +0.42 
with a = 1.097 9°39. 
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Fig. 14. Projected mass distribution with a smoothing scale of FWHM= 4’ and units of signif- 
icance of vy = K/o,. The shear is used without taking into account the LSS lensing effect. The 
contours of significance start at lo with a step value of lo (Okabe et al.'®8). (For color version, 
see page II-CP13.) 


This agrees with the predicted value of ~ 0.9-1.0 from an N-body simulation 
based on the CDM scenario. 


4.5. Cosmic shear 


The most challenging observation in weak lensing is cosmic shear, which is weak 
lensing due to the large scale structure of the universe. It attracted much attention 
because its signal depends on the expansion rate of the universe and the growth 
rate of structure. It therefore carries useful information about dark energy and the 
theory of gravity. 

Cosmic shear has already been detected by several groups.?° °° Subsequent 
detections have put useful constraints on the matter density parameter Q,,,9 and 
on the matter density spectrum normalization og (Bacon et al.'°?; Hoekstra, Yee 
and Gladders!?; Bacon et al.!°4; Hamana et al.!°°). More recent results are found, 
for example, in Fu et al.1°° for the CFHTLS third data release; Schrabback et al.!°° 


Fig. 15. 
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Subhalo mass function spanning two orders of magnitude of subhalo masses. The red 


solid and blue dotted lines are the best-fit power-law model and Schechter function, respectively. 
The best-fit powers are in remarkable agreement with CDM predictions. Green dashed lines are 
the mass function for spurious peaks. The thick and thin dashed lines are the best-fit and the 68% 


C.L. uncertainty, respectively (Okabe et al.158). (For color version, see page II-CP13.) 
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for the Hubble Space Telescope COSMOS survey; Benjamin et al.'°” and Kilbinger 
et al.!°® for CFHTLens; and Jee et al.!®° for the Deep Lens Survey. In particular, 
the Canada—France—Hawaii Telescope Lensing Survey (CFHTLenS) covers 154 deg? 
in five optical bands where the shapes of 4.2 million galaxies with 0.2 < z < 1.3 
(photometric redshift) are measured. Combining with WMAP7, BOSS and an HST 
distance-ladder prior on the Hubble parameter, it is found that Qy,,9 = 0.283+0.010 
and og = 0.813 + 0.04 for a flat ACDM model. 

On-going and planned galaxy surveys aiming to observe cosmic shear including 
the Panoramic Survey Telescope & Rapid Response System (Pan-STARRS),° the 
Subaru Hyper Suprime Cam (HSC) Survey,!®! the Dark Energy Survey (DES) ,!©? 
the Large Synoptic Survey Telescope (LSST),'6* the KIlo-Degree Survey (KIDS)164 
and Euclid.!®° However the signal is extremely small and contains many system- 


atic errors such as inaccurate shape measurement, and inaccurate determination of 
redshifts. Also, we do not yet have a complete theoretical understanding of small 
scale clustering. Thus it is essential to reduce these systematic errors down to a 
required level in order to place a useful constraint on the cosmological informa- 
tion (Huterer et al.1®°; Cropper et al.'®’; Massey et al.1®8) 
actively developing research field. Here we give an introduction to understand the 


. This is an ongoing and 


physics of cosmic shear. 

Cosmic shear observations measure the coherent distortion of background galax- 
ies. This is related to the convergence of the matter density field and thus the matter 
power spectrum. In fact, there is a relationship between the shear and convergence: 


(4(k)4(k)) = (&(k)A(k')) = (277)?C(k) dp (k — k’), (142) 


where the quantities with a “hat” are Fourier transforms of the corresponding 
quantities, and C(k) is the lensing power spectrum of the shear (convergence). As 
easily imagined, the power spectrum is related to the matter power spectrum, and 
thus the observation of the cosmic shear provides a method to measure the evolution 
of the matter power spectrum. 

The details are as follows. First, the cosmological lens equation may be written 
as the equation for the convergence field by the matter distribution at the comoving 
distance A(z) as follows: 


30mH2 PP (A—N) 
0 0: NO f 14 
&(0, 2) = | dx alvya d(x’, A), (143) 


where 6 is the matter density contrast and the Limber approximation has been 
used. We have neglected the contribution from integration along the line of sight. 
As seen from the above equation, the dependence of the cosmological parameters 
appears in two parts, in the comoving distance and in the evolution of the density 
contrast. The latter depends not only on the global geometry of the universe but 
also on the local gravitational force, and thus cosmic shear is in principle able to 
distinguish between dark energy and an alternative theory of gravity as the origin 
of the accelerated expansion. 
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The power spectrum for shear (convergence) may be written in terms of the 
matter power spectrum P; as follows: 


9H4A0m [> .. 9?(d) k 
CE = we f d\—~ Ps ( =; x J. 144 

(= ee fanaa (Sex) 
This is the desired relationship between the power spectrum of shear and the matter 
power spectrum. Here g(x) is defined as 


AH ri 
0d) = f° axpQ)~>>. (145) 


with p(x) being the observed probability distribution of galaxies for the survey. A 
precise knowledge of p(x) is only obtained by spectroscopic observation, which is 
very time consuming for a large survey. 

Assuming a close correlation between the dark matter density contrast d(z) 
and the number density of galaxies n,(z), the sum of a huge number of galaxies 
makes the convergence observable. Although large and deep redshift surveys are 
planned in the future, a photometric determination of the redshift (photo-z method) 
is used for existing surveys and those planned in the near future. One of the errors 
in the observation of cosmic shear arises from the error in the photo-z method. 


4.5.1. How to measure the cosmic density field 


In practice, the following two point correlation functions are observed: 


£4 (9) = (14%4)(9) = (1x x), € (8) = (Y=) (9). (146) 


Since y, — yz, nad yx — —7x under the parity transformation, €, should vanish. 


These correlation functions may be expressed in terms of the power spectrum. 
For this purpose, it is convenient to choose the coordinate as 6 = (6,0) and then 
V1 = —%t,72 = —Yx. Thus 


E+ = (1(0)n(0)) = f Fe ro(e0jC10) (147) 
Similarly 
g-= f Fuoco, (148) 


where J;,(x) is the nth Bessel function. However, the power spectrum is expressed 
in terms of these two point functions 


C(l) = 2m | a866..(0) o(€8) = 2m | a60s (0) J4(£0), (149) 
where we use the orthogonality of the Bessel functions 


i dxxaJn (ux) In (vx) = “sp(u —v). (150) 
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In practice, the averaged tangential shear is used. For this purpose, first we 
define the window function U, which satisfies the property 


[ou 4.) =0, (151) 
where 0, characterizes the radius of the averaging. The filtered convergence is then 
defined as follows: 

F(6r,:80) = | @P0"U(|6' ~ Bol; 6,)x(8"), (152) 


where @o is the center of the region averaged over. One can easily confirm that the 
above may be written as the integral of the tangential shear 


F630) = [ 26'Q(|6! ~ 80|:0,)r0(6") (153) 
where 
Q6:0.) = a f " d0!0'U (6';8,) —U(0:0,). (154) 
Consequently, the ensemble average is given by 
(F2) (0:89) = an f aec(e) (/ 10000, 0,).0(08) (155) 


One can obtain information regarding the power spectrum (which depends on 
the cosmological parameters) by observing the dispersion of the aperture mass 
which is constructed by measuring the tangential shear. 

There are several choices for the window function. Here we use one proposed by 
Schneider et al. (2002): 


9 ( ) (; -) 
1 0 < 6; 
U(0) = ¢ 75; Ges oF . (156) 


2, 2 
a2 (1-5) 0< 06, 
Q(0) = ¢ 75; OF 


(157) 
0) 0 > 6,. 


Consequently, the ensemble average is called the aperture mass and is written 
as follows: 


(M2,)(0,) = 2m ; dleC(£)1(£0,.) (158) 
with 
2 
1(00,) = gy te) (159) 


The functional form of the kernel [(¢6,) for the aperture mass shows that it 
plays a role of picking up modes @6,. ~ 5 in the power spectrum, and is considered 
a good statistical quantity to evaluate the power spectrum. 
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Since the power spectrum may be expressed by the correlation functions €4 


oae=5 f Bir. (Zorn (2)e@), coo) 


where 
T(x) = 576 | © jolat) [MOP (161) 
T_(x) = 576 [ Salat) [zateyl (162) 
we can also define 
ie ; 0Q(0)%(0). (163) 


This quantity vanishes identically as long as the convergence is real. Thus (M2?) 
may be used for the evaluation of noise: 


mayo.) =5 f° Sr [te (g)e@-7- (g)ew@]- aoa 


In an ideal situation, (M2) should vanish identically because €, = 0. This does 
not happen in practical observations due to the existence of noise. Actually it may 
be shown by using the E/B decomposition of the convergence that (M2) is the 
B-mode part of the aperture mass. 

However the observed region is finite and the number density of background 
galaxies is finite in practical observations. Thus we cannot take a correlation 
between two galaxies separated more than some O0nax and less than Omin ~ 1/ JNg- 
Thus &- and B-mode decomposition is not perfect and there exists leakage from 
the E-mode to the B-mode on small scales if 8, < 2@max. To avoid such leakage it is 


possible to choose a window function to define a filtered convergence that it can be 
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calculated using only the two point correlation function with a finite separation. 

In practical measurements of the two-point correlation function, we need to con- 
struct an unbiased estimator and a covariance matrix in order to compare observ- 
ables with theory. We ask the reader to consult relevant references such as Schneider 
et al.‘™ and Semboloni et al.'”? for detail. 


5. Conclusion and Future 


At 100 years after its birth, general relativity has become an indispensable tool for 
understanding our universe. Gravitational lensing is not the only example, but is 
certainly one of the most actively investigated examples, which was brought about 
by rapid advances in observational apparatus and techniques since the 1980s. This 
tendency will continue in the future with ongoing and planned weak lensing surveys 
such as the Subaru HSC survey, Pan-STARR, DES, KIDS and Euclid. The role of 
gravitational lensing will become increasingly important in observational cosmology. 
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In this review, we have provided a pedagogical introduction to the basics of 
gravitational lensing and highlighted some of the recent achievements. The research 
area where gravitational lensing plays a role is very wide. In fact, any cosmologi- 
cal observation suffers from the effect of gravitational lensing to some extent. For 
example, the magnitude—redshift (m-z) relationship in Type Ia Supernovae (SNIa) 
173,174 and it is expected that 
future supernovae surveys will determine the nature of dark energy. Since the m—z 
relation is influenced by the lensing by large scale structure, this program is pos- 
sible only with an accurate estimate of the lensing effect. Lensing effects by large 
scale structure in cosmological observables have been regarded as noise from an 


was used to confirm the acceleration of the universe 


observational point of view, but their measurement may turn out to yield impor- 
tant information on structure formation and cosmology if we have an accurate 
theoretical understanding of its origin. 
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Starting with an account of historical developments in Japan and Russia, we review 
inflationary cosmology and its basic predictions in a pedagogical manner. We also intro- 
duce the generalized G-inflation model, in terms of which all the known single-field 
inflation models may be described. This formalism allows us to analyze and compare 
the many inflationary models that have been proposed simultaneously and within a 
common framework. Finally, current observational constraints on inflation are reviewed, 
with particular emphasis on the sensitivity of the inferred constraints to the choice of 
datasets used. 


Keywords: Cosmology; inflation; cosmic microwave background. 


1. Introduction 


The classical Big Bang Cosmology! was the first theory of the universe starting 
from physical principles that succeeded in interpreting in a unified manner the 
three fundamental cosmological observations of the time: The homogeneous cosmic 
expansion, originally discovered by Hubble? and Lemaitre?; the cosmic microwave 
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background radiation (CMB) discovered by Penzias and Wilson*; and the abun- 
dance of light elements.°*® The classical Big Bang cosmology, however, suffered from 
several fundamental difficulties such as the horizon problem, the flatness problem,” 
and the initial singularity problem. All these problems relate to the initial condi- 
tions for the universe. Because reliable theories to describe the very early universe 
were lacking, these problems were not studied seriously until quite long after the 
confirmation of the classical Big Bang cosmology by the discovery of CMB. 

Such a situation was somewhat changed by the introduction of grand unified 
theories (GUTs) by Georgi and Glashow.® Since the unification scale of GUTs, 
Maur = 10!°~!® GeV is on one hand too big to be experimentally accessible but 
on the other hand is just below the Planckian scale, Mp, = (fic®/G)'/? = 1.2 x 
10!° GeV, where quantum gravitational effects become important, there emerged a 
trend to use the high temperature and density regime in the early universe as an 
arena for its verification, stimulating serious study of the birth of the universe. 

At the same time, however, GUT theories challenged the classical Big Bang 


cosmology by predicting a phase transition? which overproduces!” 


magnetic 
monopoles!!:!? that would dominate the matter content of the universe, leaving 
no room for ordinary baryonic matter, with Qmonopole/Qo 2 10!°. Fortunately, 
however, the path toward resolving this serious problem was already present in 
the GUT itself, as observed independently by Sato!? 15 and Guth.'® Inflationary 
cosmology thus came into being. 

These pioneers observed that in the symmetric state of the Higgs field, which is 
realized thermally shortly after the Big Bang, the field has a large potential energy 
density of order of Méy-p, and that if this state is metastable with a long enough 
lifetime, the universe would be dominated by the false vacuum energy density which 
would not decrease as long as the Higgs field remains in that state no matter how 
much the universe expands. As a result, an exponential cosmic expansion is realized 
which was referred to as cosmic inflation. Since both the particle horizon and the 
curvature radius are exponentially stretched during inflation, it can in principle 
provide solutions to the horizon and the flatness problems. 

Apart from the GUT-based approach mentioned above, there was a longer tra- 
dition in the Russian school to pursue theories of the very early universe. Although 
many were speculative, a number of important ideas were proposed. An important 
such idea was Starobinsky’s curvature square theory which realized exponential cos- 
mic expansion followed by Friedmann regime.!” Thus nowadays these three authors 
are regarded as the pioneers of inflationary cosmology?® before the slow-roll scenario 
was proposed.!%:20 

Before starting to review standard inflation updating Ref. 21, we first describe 
early history of inflation. Since much has already been written about the devel- 
opments in America (see e.g. Refs. 22 and 23), we emphasize those in Japan and 
Russia, which followed remarkably different paths and have not been discussed in 
much detail in the English language literature. 

We take units c=h=1. 
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1.1. Developments in Japan 


In Japan theories of the early universe were developed in close connection with uni- 
fied theories of elementary interactions. Compact astrophysical objects were also 
used as their testing ground. Unification of elementary interactions in nature has 
been a long-standing issue of theoretical physics which is also called the Einstein’s 
dream. As the first step, the theory to unify the weak interaction with the elec- 
tromagnetic force was proposed by Weinberg? and Salam?° in 1967, but it was 
not immediately widely accepted in the community. Only after 1973 when the 
Gargamelle experiment at CERN discovered the neutral current did this theory 
become popular. The importance of the neutral current interaction in the neutrino 
diffusion in supernovae explosions was shown in Ref. 26, which was confirmed by the 
Nobel-prize winning discovery of the neutrino bursts from SN1987A.?" The mass of 
the Higgs boson in the electroweak theory was constrained by CMB?® and stellar 
evolution.?? Symmetry restoration and further breakdown in high density matter 
was also pointed out in Refs. 30 and 31 in the electroweak theory. 

Further progress toward a unified theory was made by Georgi and Glashow, 
who proposed the grand unified theories (GUTs).8 They argued that the three 
fundamental interactions unify at an extremely high energy scale around Maur = 
10!5-'6 GeV. At this high energy scale baryon nonconserving interactions character- 
istic of GUTs are expected to have occurred without suppression. Yoshimura*®? pro- 
posed a model of baryogenesis in the early universe making use of these interactions. 

Applying studies of high-temperature behavior of unified gauge theories** °° to 
the hot big bang universe, Sato and Sato proposed that all the four elementary 
interactions were unified above the Planckian scale when gravity bifurcated, and 
at the GUT phase transition, electroweak and strong interactions were separated 
from each other, followed by the Weinberg—Salam phase transition. Then the cosmic 
history of the elementary interactions can be expressed by a tree diagram as shown 
in Fig. 1. 

Sato also realized that, if the symmetry-breaking phase transition of the 
GUT Higgs field in the early universe is first-order, the universe would expand 
exponentially in a supercooled stage when the false vacuum energy dominates the 
universe, and that a number of interesting cosmological consequences would be 
obtained. 

In the first paper of the series, Sato showed that the particle horizon would be 
exponentially stretched, so that even if CP symmetry is softly broken in baryo- 
genesis, one may realize a large enough domain with positive baryon number 
instead of a mixture of small domains with positive and negative baryon num- 
bers.'4 He then showed that fluctuations associated with the phase transition 
are exponentially stretched and may account for the origin of seed fluctuations 
for large-scale structures.!? Although the current paradigm of cosmological struc- 
ture formation attributes the origin of seed fluctuations to quantum fluctuations 
of a nearly massless scalar field which induces inflation, the essential features of 
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Fig. 1. Evolutionism of interactions. The original picture was first published in an article by 
Humitaka Sato and Katsuhiko Sato in June 1976 in a Japanese science magazine “Shizen” (mean- 
ing Nature) published by Chuo-Koron-sha. 


the generation of superhorizon fluctuations and its reentry to the Hubble radius 
in the subsequent Friedmann stage were already there.!? Finally, the monopole 
problem was also shown to be solved as the exponential expansion can push it 
away from the observable domain.!° Monopoles are diluted by subsequent entropy 
production. 

Unfortunately, however, this type of scenario making use of a first-order phase 
transition, now referred to as old inflation, was not successful because inflation could 
not be terminated to realize a hot Friedmann universe over the comoving scale large 
enough to correspond to the current Hubble horizon. On the other hand, Sato and 
collaborators seriously considered geometrical outcomes after the phase transition 
and reached the revolutionary view that our universe is not unique but there can be 
multiple universes that are causally disconnected from each other.3”3° Nowadays 
the multiverse is very popular in unified theories such as string theories and even 
the probability that a universe like ours is created is seriously considered. One 
should remember that these papers were first to discuss multiverse as an outcome 
of modern unified theories. 


1.2. Developments in Russia 


In former Soviet Union, the possibility of exponential cosmic expansion was first 
suggested by Gliner®® in 1965, who classified the possible algebraic structures of 
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the energy-momentum tensor. Among them he found a form of matter called the 
pi-vacuum, which has an equation of state (EOS) P = —y with the energy density 
p = p. This state has the same invariance as the vacuum and with « > O the 
metric takes the de Sitter form. This proposal, however, cannot be considered as the 
birth of inflationary cosmology since a discussion of the transition to the Friedmann 
regime is absent. In fact, the observation that the matter with EOS P = —p induces 
exponential expansion was already recognized by Lemaitre.*° 

This idea was pushed forward by Gurevich*! to solve the problem of causal 
connection of galaxies. Although he made some thermodynamic arguments of a 
“A” field (he used the character A instead of i), its transition to normal matter 
remained unclarified. Almost at the same time Gliner and Dymnikova*? considered 
a cosmology with EOS P = —p only in the very early universe to avoid the initial 
singularity. In this scenario the EOS was supposed to change to that of normal 
radiation or matter in the course of expansion, but again its physical mechanism 
was left unexplored. 

A paper that was very interesting from modern point of view but not recognized 
as such at that the time was the paper*® by Starobinsky in 1978 on a nonsingu- 
lar isotropic cosmological model whose material content is a massive scalar field. 
Although his objective was to find a viable bouncing solution to avoid the initial 
singularity, all the currently well-known field equations and their slow-roll solu- 
tions that are used in chaotic inflation** had already been given there. As noted by 
Starobinsky himself in his talk at the Tomalla Prize ceremony in 2009, the decisive 
step for Linde to propose chaotic inflation was to throw away the contraction phase 
(and replace it with large quantum fluctuations in the Planckian regime to establish 
natural initial conditions). 

Thus in 1970s much work was done in Russia about the birth and the very early 
stage of the universe, but many of the papers were speculative and not based on 
concrete models, and moreover did not make contact with observation. One should 
note that gems of truly novel ideas often emerge from such speculation. 

One such novel idea is the calculation of the primordial tensor perturbations in 
the early de Sitter stage which was first elaborated by Starobinsky*° in 1979, even 
before the concept of inflationary cosmology was introduced. His original motivation 
was physically sound with the desire to probe the initial state of the universe. He 
had quantum creation of the universe in mind and postulated that the initial state 
of the universe should be highly symmetric. Therefore, he considered de Sitter 
spacetime at the outset. He then considered the quantum mechanical behavior of 
gravitons to probe the initial state of the universe before the big bang, when other 
particles with stronger interactions were likely thermalized thus losing the memory 
of the initial state. 

In this paper again, however, the physical discussion on the transition to the 
Friedmann regime was absent. The sound physical mechanism to realize de Sitter 
expansion as well as subsequent transition to the Friedmann regime was finally 
given by Starobinsky by incorporating higher-order curvature terms in the action 
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based on quantum corrections.'? Mukhanov and Chibisov*® calculated quantum 
fluctuations generated in this model and pointed out the possibility that they may 
account for the origin of seed fluctuations for galaxy formation. 

Meanwhile, apart from these mostly geometrical considerations, Kirzhnits and 
Linde**:** investigated the high temperature behavior of unified gauge theories 
and phase transitions which may have occurred in the hot or dense early universe. 
Their study formed a basis of the inflationary cosmology. Indeed soon after the old 
inflation model was proposed and its problems were elucidated, Linde!® proposed 
the new inflation model based on a Coleman—Weinberg type potential for the GUT 
Higgs field, in which inflation occurs as the scalar field induces a slow-roll over 
phase transition toward its zero-temperature minimum after thermal correction 
of the potential has disappeared due to cosmic expansion.!9?° This was the first 
slow-roll inflation model and the first small-field inflation model using a concave 
potential which is sometimes called hill-top inflation.47 Unfortunately, however, 
the original new inflation predicted a too short epoch of inflation with too large 
density fluctuations,*® and thus could not reproduce our universe. Furthermore, as 
realized by Linde himself, the universe would have been far from a thermal state 
at the GUT era, hence one cannot expect thermal symmetry restoration to set the 
appropriate initial condition for new inflation. These consideration led to chaotic 


inflation.*4 


1.3. Inflation paradigm 


Now the standard paradigm of inflation is that the accelerated expansion is realized 
when a scalar field, dubbed as the inflaton, slowly rolls down its potential toward a 
global minimum in a time scale longer than the Hubble time.!®:2°44 The remarkable 
feature of the slow-roll inflation is that it cannot only explain the global properties of 
the observed universe, but also provide seeds of density and curvature fluctuations 
that evolve into large-scale structures.4645 °° The temperature (and polarization) 
anisotropies generated on CMB at the same time provide indirect cosmological tests 
of the inflation paradigm (see e.g. Refs. 51-53 among others). 

In fact, the slow-roll inflation driven by a potential is not the only possibility. 
There are presently two alternatives. One class of models realizes inflation with- 
out introducing any inflaton field but modifies gravity from the Einstein’s general 
relativity. As mentioned in Sec. 1.2 Starobinsky was the first to show that quasi- 
exponential expansion and subsequent transition to power-law expansion could be 
realized by incorporating higher-order curvature terms in the action based on quan- 
tum corrections.!” Nowadays its simpler version including square scalar curvature 
term besides the Einstein action is referred to as the Starobinsky model. The other 
alternative is comprised of models which make use of a scalar field with a higher- 
order kinetic function. If we can realize a state with a constant canonical kinetic 
function its energy density can have the same equation of state as the cosmological 
constant to drive exponential inflation. Such models are referred to as k-inflation.** 
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All these models can be regarded as a subclass of the generalized G-inflation 
model,®*5° which is the most general theory of single-field inflation with its field 
equations given by second-order differential equations as with most of the other 
theories of fundamental physics. 


2. Resolution of Fundamental Problems 


Although the original inflation models of Sato and Guth realized exactly exponential 
cosmic expansion, it is not a necessary condition to resolve horizon and flatness 
problems. All we need is to realize a sufficiently long period of accelerated expansion, 
when the particle horizon increases more rapidly than ¢ and the Hubble horizon, 
so the horizon problem can be solved. From the Einstein equations for the cosmic 
scale factor a(t) 


‘ 2 
a K 8G 
(=) Aa, (1) 
a 4nG 
a —~g (0+ 3P), (2) 


we see that such an expansion law is possible if cosmic energy density p and pres- 
sure P satisfy p + 3P < 0. Then the energy density decreases less rapidly than 
a~*(t) so that the curvature term in the Friedmann equation (1) K/a?(t) becomes 
unimportant. As a result, the flatness problem is also solved simultaneously. 

Of course, such a stage dominated by a new energy component should not last 
forever but the universe must be converted to a state dominated by radiation to 
realize the initial state of the hot Big Bang cosmology, since we do not wish to 
demolish its success. The accelerated or inflationary expansion must be followed by 
creation of radiation and entropy. In fact it is this epoch rather than the inflationary 
expansion itself that the monopole and other unwanted relic problems such as 
domain walls, gravitinos, etc. are solved. Since inflation exponentially dilutes both 
monopole density and entropy density in the same way, the current universe with 
a huge entropy with a negligible amount of monopoles would never be realized 
without entropy production at the reheating. 

Let us consider quantitatively how much inflationary expansion is required to 
solve the horizon and flatness problems in terms of entropy consideration. First the 
entropy contained in the current Hubble radius Hy 1 _ 4.2 x 10° Mpc is given by 
that of the CMB photon and neutrino background with the effective temperature 
T, = 1.95K as So = 2.6 x 108°. We consider the condition that the comoving Hubble 
volume at the beginning of inflation, 4" ~3 at t = t;, contains more entropy than 
So after the reheating, so that currently observable region is well inside the initial 
Hubble radius at the onset of inflation. 

For simplicity, suppose that the energy density takes a constant value pins during 
inflation and it continues from t = t; to ty to stretch the scale factor by af/a; = e%, 
and that after inflation the universe is dominated by a component with equation- 
of-state P = wp until the universe becomes radiation dominant with the reheating 
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temperature Tz. At this time the initial Hubble radius has been stretched to 


1 
29,74 \ 308) 
Ho! N[{7™ GxtR = ' 3 
: Ge a es 
The corresponding volume contains the entropy 
5 = 409 73, AT a _ 162 (459%! Ta ( H\ it (Ty \ 
— yA = — ay 
og. a 270 473 Mp\ Mp ’ 


(4) 
where g,. is number of relativistic degrees of freedom at reheating. Requiring this 


to be larger than 5p, we find 


12.5 — 10.8w w in ( Gx + 1+3w n( “_ 
l+w 3+3w \106.75/ ' 6(1+w)  \0.01 


1—3w Tr H . 
t l = Nmin, = 0.01 ? 
34+3u (= es] Coa & x 1013 aa) (5) 


where r is the tensor-to-scalar ratio to be defined later. 
As mentioned above, this is the condition that the initial Hubble radius at the 
onset of inflation is stretched to be larger than the current Hubble radius. However, 


N > 67.7 


we must note that the observed density perturbation on this scale is observed to be 
10-°. We expect that the initial amplitude of fluctuation on this scale can be close 
to unity just to initiate inflation. Thus it is not sufficient to have N = Nin but 
we need more inflation by a factor of (10-5)-2 ~ 500 to suppress the amplitude 
of fluctuations to an acceptable level.°” Thus the minimal condition to solve the 
horizon problem is actually N > Nin + In 500 = Nin + 6.2. 


Taking this extra number of e-folds into account we find 


1 i 1 TR 
N>55+2n(“_) +21 : 6 
6° \0.01/ "3 (rear) fo 
for w = 0, and 
1 Gx 1 i I TR 
N>67—-1 ( ) a ( ) 1 7 
6 \106.75/ "3 \0.01/ 3" \ 108 GeV (7) 
for w= 1. 
Next let us consider the evolution of the total density parameter 04.4 using 
K 
‘ae = H? (Otot = 1). (8) 
We find 
Qrot(to) = 1 (coz -2(N—Nenin) -2 6 
-_ =e min) < 500°7 = 4x 107°. 9 
OQror(t;) — 1 ao Ho (9) 


Thus, if the horizon problem is solved by inflation, the flatness problem is automat- 
ically solved and such inflation models predict the total density parameter today is 
equal to unity with four to five digit accuracy. 
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3. Realization of Inflation 
3.1. Three mechanisms 


As mentioned in the introduction one can classify realization mechanisms of infla- 
tion into three broad categories. 

The standard one is the slow-roll inflation models. Consider a scalar field @, the 
inflaton, with a canonical kinetic term X = os g'”0,~0,@ and a potential V[¢]. 
Taking the variation of the action 


s= f cy=gate = f(x - Viel) V=aate (10) 


with respect to the metric tensor, one finds the energy-momentum tensor 


2 6S 
Tw = er = OnG0V0 + LOyw- (11) 
For homogeneous field configuration, the energy density, p, and the pressure, P, are 
given by 


p=584+Vid, P=56-Vidl, (12) 


respectively. Hence if the potential is so flat that V[¢] > ¢? is satisfied for a long 
enough period, the equation-of-state parameter w = P/p is smaller than —1/3 and 
an accelerated expansion can be realized. 

Next, in order to see how a scalar field with a noncanonical kinetic term may 
affect the dynamics of the universe, let us rewrite the Lagrangian in more general 
form as £ = K(X,¢). Then calculating the energy-momentum tensor in the same 
way as (11), we find 


p=2XKx—-K, P=K (13) 


in this case. Thus an accelerated expansion is possible even without any potential, if 
XKx < —K is satisfied. In particular, in the case Kx = 0K/O0X = 0 holds we find 
de Sitter expansion. This is what is called k-inflation. If we expand the Lagrangian 
as a power series of X like 


K(X, 0) = Ki(d)X + K2(4)X?... (14) 


we immediately find that A (¢) and K2(¢) should have an opposite sign in order 
to realize inflation. 

So far we have implicitly assumed the gravity sector is expressed by the Einstein— 
Hilbert action 


1 
Sp= 55 [Rvs (15) 
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and considered a single scalar field as a matter ingredient. We may realize inflation 
even without any scalar field matter by modifying gravity as 


5, = 5a | BW=9e"e, (16) 


where f(A) is a function of scalar curvature R. The trace of field equation is given 
by 


30f(R) + f'(R)R — 2f(R) = 0. (17) 
One can therefore find a de Sitter solution, R = 12H? =const., if f(R) satisfies 
f'(R) = 2f(R), namely, f(R) « R?. A pure R? model cannot terminate inflation 
nor has the proper Einstein limit, so we adopt 
R?2 

This theory is a nonperturbative extension of the Einstein gravity, which contains 
additional scalar degree of freedom called scalaron. 

These mechanisms may not work in a mutually isolated manner but the actual 


inflation may have been induced by their combinations®® with possible higher-order 


derivative terms.°° 


3.2. Inflation scenario 


Let us describe cosmic evolution using a slow-roll inflation model as a prototype. 
Once inflationary expansion sets in, the universe rapidly becomes homogeneous, 
isotropic, and spatially flat, and the energy density other than the inflaton is soon 
diluted away. Hence we may consider a spatially flat FLRW universe from the begin- 
ning except when we discuss the feasibility of inflation in a generic inhomogeneous 
and anisotropic spacetime. Thus the scalar field equation and the Einstein equation 


read 
+ 3Hd + Vl] =0, (19) 
ay" 87s _ Pe 1 
at ee ee _ Pb _1lio 
({) 3Mz, 3M °° 5% + V9), (20) 


respectively, where Mg = Mp,/V/87 = 1/V8nG = (hc®/87G)? = 2.4 x 1018 GeV is 
the reduced Planck mass. 

To realize successful inflation the potential V[¢] must remain the dominant 
contribution to pg for a sufficiently long time, that is, the scalar field must remain 
practically constant over the cosmic expansion time scale. For this purpose d must 
contribute negligibly in the equation of motion. Then field equations read 


3H¢é+V'[¢] =0, (21) 


a)? _ 8nVIdl _ Vid) 
a 3M2, 3M3’ 
which are called the slow-roll equations of motion. 


(22) 
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For these approximate equations to hold, the potential must satisfy 


_ Ma (V'ldl\* = y2v_l4l 
=5 (Fe) <1, mw =Mée id)’ Inv| <1. (23) 


Here ey and ny are called the (potential) slow-roll parameters. Conversely, if these 
slow-roll conditions (23) are satisfied, inflationary expansion is likely to start as soon 
as the potential energy dominates the cosmic energy density. Inflation or accelerated 
expansion is terminated when ¢? becomes larger than V[@] or when |H| becomes 
larger than H?. Under the slow-roll approximation, (21) and (22), the former occurs 
at ey = 3/2 and the latter at ey = 1. The discrepancy is due to the invalidity of 

the slow-roll approximation in such a regime. 
Under the slow-roll approximation, the number of e-folds of inflation after ¢ has 

N= | Hdt= — = 

; ON Q Me 


crossed a value @y is given by 
f° do 
on Jey 
with oy being the value at the end of inflation. 

Figure 2 shows typical shape of the inflation potential, which has two possible 
field ranges to realize inflation. Near the local maximum at ¢ = 0 the potential 
is so flat that inflation may be possible. Models of such class are called small-field 
models. On the other hand, if the potential increases at most with a power-law, 
one can see that the slow-roll conditions (23) are satisfied for super-Planckian field 
values, so that inflation is also realized there. Such a model is called a large-field 
model. 

In both cases the slow-roll conditions no longer holds once the field approaches 


the global minimum at ¢ = v, and it starts to oscillate around it. Thus the potential 
energy density is transferred to field oscillation energy, which will eventually decay 


of do 1 


(24) 


V[o] 


Large field model 


Small field model 


Fig. 2. Scalar field potential for single-field slow-roll inflation. 
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into particles interacting with ¢. The universe will then be heated up to realize the 
initial condition of the big bang universe physically in the modern context. 

In fact, Starobinsky’s R? inflation can be regarded as a variant of a large- 
field model because the modified gravity action can be converted to the Einstein 
gravity plus a scalar field with a potential. Specifically, in terms of conformal 
transformation, 


? R 
Iuv = (1 =F sa) Juv (25) 


we find the action is equivalent with 


l 5 1 
Sy = OK [vier [vets (59""0.08,0- vial) : (26) 


Kb = em (1 in) , Vid] = oM3MP(1 — e VERO)? (27) 


Thus the system is equivalent with a scalar field matter in the Einstein gravity. 
For ¢ > Mg = «~"', the potential has a plateau with a height 3M2,.M?/4 where 
inflation can occur. Inflation is followed by scalar field oscillation around the origin 
and reheating proceeds through gravitational decay of the inflaton @. 

Finally, k-inflation of the type (14) is terminated when both Ky(¢) and K2(¢) 
become positive. Then X starts to decrease rapidly and only the first term becomes 
relevant. Now the universe is dominated by the kinetic energy of a free scalar 
field whose energy density dissipates in proportion to a~°(t) with the equation- 
of-state parameter w = 1. This abrupt change from de Sitter to a power-law 
a(t) x t!/3 induces gravitational particle production to reheat the universe in this 
model. 


4. Slow-Roll Inflation Models 


Let us focus on specific models of slow-roll inflation with a canonical kinetic term. 


4.1. Large-field models 


This model was originally proposed by Linde under the name “chaotic inflation” ,44 


since it makes use of the chaotic initial condition of the universe at the Planck time 
tp) = (hG/c®)1/? = 5.4 x 10~*4 sec, when we expect large quantum fluctuations to 
dominate. Let us consider the following simple Lagrangian as an example assuming 
that Einstein gravity starts to hold at that epoch. 


Ly = -5(08)* Vid), Vd] = 5mPe (28) 


By virtue of the uncertainty principle, at the Planck time we expect that both 
gradient energy and potential energy densities were fluctuating with the Planckian 
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magnitude, so that 
1 1 
-5(06)" = Mp, sme - Mp, (29) 


in each coherent domain. Taking m < Mp, we find ¢ can take a large value up 
to 6 ~ M3,/m > Mp). If the magnitude of ¢ is saturated, from the gradient 
energy constraint we find the coherent length of the field, L, satisfies $(6/ Ly? ~ 
M$,/(Lm)? < Mg, or L = m7! > Mj. That is, ¢ is homogeneous over the 
Compton wavelength well above the horizon scale at the Planckian regime. 

Then over the horizon scale one may regard the field as homogeneous and apply 
the homogeneous field equations derived in the previous section. Solving the field 
equation (19) and the Einstein equation (20), one finds 


(t — ti), (30) 


= af exp ; — 20 M2, 

Inflationary expansion is terminated around ¢ < Mp)/ 4x when ey becomes 

unity and |d/@| becomes as large as H. Therefore, in order to solve the horizon 
problem one needs an initial amplitude ¢; 2 3Mp). 

The most stringent constraint on this model comes from the amplitude of density 

and curvature perturbation generated during inflation, which sets the mass m = 

10'3 GeV and constrains the self-coupling of the form \¢4/4 to A < 107!” (see 


Sec. 13.1).°9 


4.2. Small-field model 


As mentioned in Sec. 1.2, the first small-field model is Linde’s new inflation model!® 
which actually has a problem in realizing the required initial condition. There is 
another class of small-field inflation models called topological inflation®”*! 
this problem is solved for topological reason without resorting to thermal symmetry 
restoration. Consider a simple Lagrangian 


in which 


1 mn 
£=-3(06)?— Vid], where V[d] = 3(¢? — 0)”, (32) 
which has a local maximum at ¢ = 0. Since ¢ is a real scalar field, this model 
admits a domain wall solution connecting two vacuum states 6 = +v. Neglecting 


gravitational effect for the moment, we find a solution 


o(a) = vtanh (3) (33) 
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that describes a domain wall on the xy-plane. Its thickness do can be estimated by 
= 

equating (Vd)? ~ (v/do)? and V[0] = Ve as dy © vVe ?. On the other hand, the 

Hubble horizon scale corresponding to the vacuum energy density V. is given by 


8nG_.\~? 3 \? 
H-* = (=—y, = M 34 
st = (SE) atm (gE)" 00 


which is smaller than the thickness of the wall if v = Mp). In this case one can find 
a region with a large potential energy density V ~ V. whose dimension is larger 
than the Hubble scale near the center of the domain wall. Such a field configuration 
provides a sufficient condition to initiate inflation there without being affected by 
the configuration outside the Hubble horizon. 

In this model, an initially random field configuration in the global spacetime 
naturally creates domain walls inside of which inflation sets in near the central 
core. In this sense this model provides a natural mechanism of small-field inflation 
provided that the universe continues to expand until the energy density of the 
domain wall is locally dominant. We can also show that inflation ends after a finite 
time except for the locus with ¢ = 0, and the reheating process proceeds just as in 
the chaotic inflation model. 


4.3. Hybrid inflation 


This is a model to induce inflation by a false vacuum energy density through a 
nonthermal symmetry restoration by virtue of an extra scalar field.6? The simplest 
model of hybrid inflation consists of a real scalar field (¢) and a complex scalar 
field (y) with a Lagrangian 


£ = ~(0x)'(x) — 5(06)? — Vix 4, 


where 
Vind) = SUlxl? — 02)? +o? bbl? + 529 (35) 
From 
it = X(IxP? —0?)x + 9°", 
av we 
Fat = ME = AQP - 04) + 070", 


we find that y = 0 is a potential minimum if the inequality g?¢? > Av? is satisfied 
initially. Then the potential reads 


V[x = 0, d] = Ay! + sme. (37) 


If the first term dominates the energy density of the universe, inflation takes place. 


It is terminated at @ < ae when y induces a phase transition, which occurs within 
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one Hubble time typically unless fine-tuning of parameters is applied. This model 
is attractive from particle physics viewpoint because enough inflation is possible 
even if @ is much smaller than the Planck scale®? and neither \ nor g requires 
fine-tuning. Instead the initial fields configuration of the two scalar fields do need 
tuning to realize inflation.6+® 

In this model, since y is a complex scalar field, a network of strings will be formed 
after the phase transition. Where y a real scalar field, a network of domain walls 
would be formed and dominate the energy density. Other mechanisms of formation 
for the topological defects have been proposed in Refs. 66-70 near or at the end 
of inflation. Hence there may well remain some relic defects even if their energy 
scale is higher than the maximum temperature after inflation so that symmetry 
restoration by thermal effects is impossible. 


5. Reheating 


The entropy production process required after inflation to realize the initial condi- 
tion of hot Big Bang cosmology is called reheating, although in modern inflationary 
cosmology the universe had never previous been dominated by radiation. 

As is clear from the discussion in the previous section, the universe is dominated 
by coherent scalar field oscillation after potential-driven slow-roll inflation. Let us 
take the large-field massive scalar model as an example. As the inflaton starts to 
satisfy |¢| < Mpi/V6z the scalar field starts to oscillate around the origin with 
a period 27/m. Such field oscillations are equivalent to the condensation of the 
homogeneous zero-mode of the scalar field which decreases its amplitude through 
cosmic expansion, to decay into radiation finally. In the initial oscillatory regime 
when its amplitude is large, some nonperturbative particle production also takes 
place efficiently to some extent. This is called preheating.’! But the final reheat- 
ing stage is always dominated by the perturbative decay of the scalar field, which 
is accounted for by incorporating a decay term ['g¢ in its equation of motion.747 
Here Ig is the decay rate of a @ particle which is much smaller than m since the 
inflation must be weakly coupled to other fields to suppress quantum fluctuations 
to an acceptable level as discussed in the following sections. 

Multiplying ¢ to (19) after introducing the above mentioned dissipation term, 
we find 

d 


a (5% + sme?) = —(3H +T4)¢’. (38) 


Since ¢ is rapidly oscillating over the cosmic expansion time scale, we can replace 
é? in the right-hand side by an average over the oscillation period, é , which is 
identical to the total energy density of @, pg, thanks to the virial theorem.’’ Thus, 
we find a Boltzmann equation 


dpg 


ai (3H +T9)p¢, (39) 
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which is associated with that for radiation energy density p, 


Pe = —4Hp, + T epg. (40) 
These two equations can be solved as 
-3 
pelt) = pelts) [0] exol-Tott th (a1) 
tha ae 
ett) =F f [SO] polar. (4) 


Thus the universe starts to be dominated by radiation at t ~ te when the radi- 
ation temperature Tp is given by 


1 ad 
87 2 8x 17 gy 2 1 1 

H=(=7p,) = Tr) =a ==°s¢, 43 
(suze ) (siz 30 ’) a oy 

i 1 + 

200\ 4 200 \ 4 r 2 
Tr ~0.1 ./Mpils & 10" $ GeV. 44 
x (2 ) nee (= ) (sordev 4) 


Such is the case inflation is followed by scalar field oscillation. If inflation is 
induced purely by kinetic terms as in the case of k-inflation®* and G-inflation,®° the 
entropy after inflation is attributed to gravitational particle production”® associated 
with the change of the cosmic expansion law. Here we describe this process following 
Ref. 79. 

Let us consider the case where the cosmic expansion law changes from de Sitter 
with the Hubble parameter Hin¢ to a power law with the index t'/? corresponding 
to a universe dominated by the kinetic energy of a free massless homogeneous scalar 
field which is called the kination regime. 

From the standard calculation of particle creation, 
ation of massless minimally coupled scalar particles. The relevant Bogolubov coef- 
ficient 6,, is given by 


78,80,81 we consider the cre- 


== e- 2 1V (n)dn, (45) 


= Be = 


with V(m) = 4a?()R(n), which yields expressions for the number density and 


energy density 


= 1 aa ae 
m= sage | WBeluPde (16) 
fie [ (Bo |2w2dw. (47) 
Qn2a4 0 


If we approximate the transition from de Sitter to the power law expansion occurred 
abruptly we find the resultant radiation energy density diverges. We must adopt 
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a model where the scale factor evolves smoothly up to the second time derivative. 
We consider the following model where the scale factor a?(7) = f(Hinen) evolves as 


= (a < —1), 
f(z) = do +412 +agt?+a3z° (-l1<2<29-1), ee) 
bo(a + b1) (to -1< 2) 
requiring f(x), f’(), f’ (a) to be continuous at « = —1 and x = xp — 1, so that the 


scalar curvature is continuous. Since the transition occurs within the Hubble time 
xo is smaller than unity. The coefficients are determined as (up to O((a9)°)), 


1 3 3 
ao =6-—, a, = 8- —, ag =3-——, (49) 
Xo Xo XO 
1 3 
=-—, b=2, b=. 
a3 af Oe BG (50) 
Then the energy density of created particles is given by 
Aine 
ee (51) 
where 
fe - | die i fig In len — ey" CHV), (52) 
1 
PERSP 
V@) = —_——_* (53) 


f2 
The upper limit of integration x = H7 is the time when V falls well below unity 


so that the notion of the particle is well-defined. We find J ~ —361n 29, so that the 
energy density in the beginning of the kination regime is given by 


HA, i 
pease 54 
Pr 32n2a4 "(aam) on) 


where At is the time elapsed during the transition from de Sitter to kination. We 
will ignore the logarithmic dependence on At and take In(1/(HAt)) ~ 1. If there 
are N modes of species created this way, (54) should be multiplied by N. 

Since the energy density of the scalar field in the kination regime behaves as 


ps = 3M2H2 a, (55) 


the scale factor when the universe becomes radiation dominant is given by 


a (56) 
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so the energy density of the relativistic particles is 
81N°H8 


inf (57) 


Pel = Seaceae 
(3202)3 M4 


Thus the reheating temperature is given in terms of the tensor-to-scalar ratio r by 


1 
BNa (30. \* He “Fi 
Te= int 10°N4 Vv. 
R= Gomi (=) oe (spars) (aor) SV: 8) 


6. Generation of Quantum Fluctuations that Eventually 
Behave Classically 


Thanks to the smallness of the slow-roll parameters, quantum field theoretic prop- 
erties of the inflaton during inflation is quite similar to that of a massless minimally 
coupled scalar field y(a,t) in de Sitter space 


ds? = —dt? + e?#tdg?. (59) 


Such a scalar field is known to exhibit anomalous growth of the square vacuum 


expectation value according to®?-*4 


(ote) = (2) me (60 


Decomposing the scalar field as 


3 
= ia b,(te® =, (61) 


we find the momentum conjugate to y(a,t) reads m(a,t) = a°(t)p(a,t), hence 

the eomuiaers commutation relation [y(a, t), 7(a’,t)] = id(a@ — a’) is equivalent to 

[a,, @!,] = 5) (k — k’) if we impose the normalization condition 
a 

a(t)” 


Pr(t)Px(t) — Pe(t)yx(t) = (62) 
Thus @) and G, satisfy the usual commutation relations and act as creation and 
annihilation operators of a k-mode, respectively. 

The mode function y,(t) satisfies 
a d i 


qa t 34a + cam | veld) = 0, (63) 


as derived from the Klein—Gordon equation in de Sitter space, which is solved as 


| 
H(- AL kn) = (1 + ikn)e7*", 64 
(= [Fun i) = ag + i (64) 
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under the condition (62). Here 7 is the conformal time defined by 


* de * dt 1 
n= gf aa Hem a 


Note that since (63) is a second-order differential equation there are two inde- 
pendent solutions proportional to H Q) (—kn) and H e} (—kn). We have adopted only 
(64) which coincides with the positive-frequency mode in the Minkowski space at 
large wavenumber limit where the effect of cosmic expansion is negligible. 

Note also that —kn = IO} is nothing but the ratio of physical wavenumber to 
the Hubble parameter. In the superhorizon limit k < a(t)H, we find 


iH ; iH ik\ , 
= —=_(1+ ikn)e~#*" = — (1- =) e!Ha 
Pk aie n) aaa Ha 


iis '*°((a)) . 


so that we can put y*(t) = —y,(t) and regard @,(t) = yr (t)(G. — a!) holds when 
the wavelength corresponding to k is much larger than the Hubble radius. Then the 
momentum conjugate to ¢, is given by #,(t) = a(t)? ox (t)(G@, — @',,), which means 
that both ¢%, and 7, have the same operator dependence in the superhorizon regime. 

Thus in this regime, where the decaying mode is negligible, the quantum oper- 
ators ~, and 7, commute with each other, so that quantum fluctuations behave as 
classical statistical fluctuations.*° Furthermore, the absolute square of mode func- 


tion behaves as 


H H 
|yx(t)|? = ae [1 + (kn)?] aaa when =+i0), (67) 


that is, it takes a constant value proportional to k~*. Multiplying the phase space 

density over a unit logarithmic frequency interval, dak, dk with dk = kdlnk = k, 

we find the dispersion takes a constant over each logarithmic frequency interval. 
One can immediately see that (60) can be reproduced from this mode function 


by summing up only superhorizon modes only as 


Hellt 3 2 
(er) [wor gar = (=) we (68) 


where the infrared cutoff is identified with the mode that left the Hubble radius at 
the beginning of inflation set to t = 0 here.®° 

The behavior that the square expectation value increases in proportion to time 
is the same as that of Brownian motion with a step +H/(27) at each time interval 
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H-'. Thus one can describe the quantum fluctuations generated during inflation 
as follows. 


For each Hubble time H~!, a quantum fluctuation with amplitude i. which is 
the Hawking temperature of de Sitter space, is continuously generated on the scale 
of Hubble radius, and it is stretched by subsequent cosmic expansion to form the 
long wavelength fluctuations. 


Finally as a preparation for the calculations of curvature and tensor perturba- 
tions, let us write down the action of a massive scalar field y using conformal time 
with the line element ds? = a?(n)(—dy? + dx’). 


1 1 
8 = [ v=aate |—591"9,.00,0— 5mo?| 


= ; ; dnd*x {a?[¢ — (Ve)?] — a*m?¢?}, (69) 


where a prime denotes differentiation with respect to conformal time. Introducing 
a rescaled variable y = ad, we find 


v= 5 f ands x” — (Vx)? -— (cen? — =) | (70) 


which is equivalent to the action of a scalar field with a time-dependent mass term 
in Minkowski space. 
In de Sitter space with a(n) = —1/(Hn), the mode function satisfies 


2 
" 2 
XE +R Xk — sx = 0,7 71 
k (—n)? ( ) 


where we have taken m = 0. Its solution may be expressed as 


ele) = (-B)" (mn) = SY (72) 


in agreement with (64) with the normalization condition x'y* — xx" = i equivalent 
to (62). 


7. Cosmological Perturbation 


We now superimpose linearized metric perturbations onto the spatially flat homo- 
geneous and isotropic background spacetime 


de? =<dt? + a(tyrdz’, (73) 
obtaining the perturbed spacetime 


ds* = —(1 + 2A)dt? — 2aBjdtda! + a7(6;; + 2H16;; + 2H7;;)de'dx?, — (74) 
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where all new variables are functions of time and position®’ ®° with Tr Ar,; =0. As 
is well known, a spatial vector B; can be decomposed to rotation-free component 
and divergence-free component as 


B; = 0;B + B;, where 0; B; = 0. (75) 


Similarly, the spatial tensor Hy;; can be decomposed as 


045 oe TT 
Ari; = (a0, = iv?) Hr + 0,7; + O;Hri + Arri;, 
where 0;Hp; =0, O;Hrr*;=0, Hrr’; =0. (76) 


Here Hrr,; is the transverse-traceless component corresponding to gravitational 
waves. Thus perturbation variables are classified into scalar, vector, and tensor 
variables according to the spatial transformation law. Variables of different types 
are not mixed in linear perturbation theory and we can treat them independently. 
Since the vector perturbation has only a decaying mode under normal circumstances 
in the linear perturbation theory,®”** we do not consider it hereafter and focus on 
the scalar and tensor perturbations. 

First let us consider scalar perturbations which are related with the density 
and curvature fluctuations, keeping only A,B, Hz, and Hr. In fact, not all of these 
variables are geometrical quantities but gauge modes are included as well. They 
arise from the arbitrariness of the choice of the background spacetime with respect 
to which we define the perturbation variables. One should remember that the real 
physical entity is an inhomogeneous and anisotropic spacetime, and that it may 
be regarded as homogeneous and isotropic only after taking some average whose 
definition is not unique in general. 

To characterize the gauge dependence, let us consider two background space- 
times with coordinates x“ and %", and assume that these two coordinates are 
mutually related by the coordinate transformation 


Bag +6 =e°+T, FP=a'+ort=2'+o'L, (77) 
where 6a" are small quantities whose relative magnitudes are of the same order as 
the perturbation variables. Here T and L are functions of the space and time coor- 
dinates. Under this coordinate transformation, we find that the scalar perturbation 
variables at the same coordinate values in two different backgrounds are related 
according to 
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respectively. Here T and L represent gauge degrees of freedom on scalar pertur- 
bation. In order to avoid their appearance, one may either find combinations of 
perturbation variables that remain unchanged after gauge transformation, or fix 
the gauge completely. For example, if we set Hy = 0, L is fixed. If we further set 


B=0, T is also fixed and there remains no gauge mode. In this case the perturbed 
metric is given by 

ds? = —(1+ 2W)dt? + a?(1+26)da?, (79) 
where we have rewritten A = WV and H,; = © to stress their gauge invariance. This 
is called the longitudinal gauge. 


In order to fix the gauge, one may combine transformation properties of matter 
variables such as a scalar field, which transforms as 


A(t, x) = o(t —T, 2! — OL) = o(t —T) +56 = o(t) —d()T +64, (80) 
that is, 
6 = 66 — OT. (81) 


8. Generation of Curvature Fluctuations in Inflationary 
Cosmology 


Here, we consider the generation of curvature perturbations in single-field inflation 

models that include not only potential-driven slow-roll models but also k-inflation. 

Further generalization to the case of the most general single-field inflation model 

with second-order field equations will be carried out in Sec. 10 based on Ref. 56. 
We start with the action 


a= [aeva [Mens Kix, ¢)|. (82) 


In the spatially flat homogeneous and isotropic spacetime (73) the background 
equations read 


3M2H? =p=2XKx—K, 2M2H+3M2H? =-P=-K, (83) 
together with the scalar field equation of motion 
a i Kx¢ : Kg Px Kx 
L 3Hed 4 232 2 P= _ 4 


In order to derive the second-order action with respect to the curvature perturbation 
in the comoving gauge, FR, it is convenient to use (3 + 1) formalism 


ds? = —N?dt? + hi;(da* + N'dt)(dai + Ndt), hij =a? (t)e?™6;;, (85) 


in which the constraint equations for the perturbation variables are easily 
obtained.®° This coordinate choice still has a gauge degree of freedom with respect 
to time translation by which the scalar field fluctuation is transformed as (81). 
Setting 6¢ = 0 one can fix this gauge freedom. 
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Then writing the action (82) in this gauge, we find 


M2 4 
_ 5 atovan(gR® +2K)+ a jf ievan eye" — E?)+---, 


(86) 
where 
1. 
where terms replaced by --- are all total derivative terms, so they do not affect 


the dynamics of the system. Here R®) is scalar curvature of 3-space and | denotes 
covariant derivative with respect to hj;. 

Since we are concerned with scalar-type fluctuations, we set N = 1+ a and 
N; = 0O;, where a and w are perturbation variables of the same order as R. 
Differentiating the action (86) with respect to N, we find the Hamiltonian constraint 


K K 1 . 
AD 4x —* E,;E% — E*) = 
R M2, MZ, N2 ( ay ) 0, (88) 
which yields 
A 5 1 =a 2 2 
av a R+ta, MabaxXKx+2X*Kxx. (89) 
a 
The momentum constraint obtained by differentiation with respect to N; reads 
db : 
Fa: - 8) = 2Ha,; —2R,=0, (90) 
FE] 


which gives a = R/H. Inserting these relations into the action, we obtain an action 
for R only as 


ey = — ss * (91) 


py ae 7 
So= Mg f ata? 20° aa €H i) I 


a2 


Introducing new variables z = a(2¥)?/H = a(2ey)2/cs, v = Ma2zR and using 
conformal time, we find 


1 iW 
So = 5 [ane’x [v2 — (Vv)? + =~], (92) 


which like (70) is an action of a free scalar field with a time-dependent mass term 
Zl! 
2 


= (ot? [e-em ae B) (04) — 4H 


= (aH)*(2 +4), (93) 
€H Cs 
VA = Hex’ _ He,’ 


with the important difference that the “sound velocity” c, may not be identical to 
unity. Still, one can quantize v in the same way as a free scalar field if the time 
variation of c, is negligible. q is a small number consisting of slow-roll parameters. 
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Using the de Sitter scale factor a = —1/(H7) one can obtain the mode function 
in the same way as in Eq. (72), so that 


ve = (—9E)" HS (—kean) = 


1 1 a qotkeet 
4 / 2kcs kesn 


which approaches to a constant in the long wave limit. 
Thus the power spectrum of curvature perturbations multiplied by the phase 
space density reads 
2 7 H? 
- 87? M2csen 


Ark? Ark? 
(27)8 (27)8 


Since each quantity in the right-hand side has a weak time dependence, it is evalu- 
ated at the time k-mode left the sound horizon when |kc,7| = 1 was first satisfied. 
The validity of this approximation relies on the fact that the mode amplitude 
remains constant after the mode has exited the horizon. 

The spectral index of curvature perturbation is defined and given by 


_ din Pr(k) 
ns -1l= diae 2eH — NH — S. (96) 
In case the scalar field has a canonical kinetic term as in the standard theory of 


particle physics, we find c, = 1 and ey = 6? /(2M2,H?). Thus we find 


ran =(25) = (8) a 


Here 6y = H/(27) is the amplitude of scalar field fluctuation. This equality is 
intuitively understandable since it represents relative fluctuation of local scale factor 


Uk 
Megz 


Prk) = [Rel? = 


(95) 


ag =jv sian”. (98) 
a ) 


where N = Ina. Since this 6N gives appropriate expression for long wave curvature 
fluctuations if we take an appropriate gauge, this 5N method®!°? is often used in 
the literature. 

For slow-roll inflation models with a canonical kinetic term, from (21) and (22) 
we find e7 = ey and ny = —2ny + 4ev, so the spectral index is given by 


ns -l1=—Gey + 2nv. (99) 


Furthermore the scale dependence of the spectral index, which is called the running 
spectral index, is given by 
dns 
dlink 


= l6eyny — 24e) —2€y, Ev = May ves 


These quantities are observationally measured using CMB, as we shall see in Sec. 12. 


(100) 
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9. Tensor Perturbation 


Next we consider tensor perturbations generated during inflation as quantum grav- 
45,93,94 Tn order to derive field equation for the graviton, first we 
superimpose a small metric perturbation h,,, onto Minkowski space background as 
Suv = Nv + hy. The indices of hy, is raised by the Minkowski tensor. We again 
need to calculate the action up to second-order in the perturbed variables, so we 
should use g4” = n¥” — h#¥” + h¥Oh*?. Taking the longitudinal and traceless gauge, 
we find hoo = hoi = 0, AX = hi =0, he, 0, so that the Ricci scalar is given by 


itational waves. 


he 6 1 ax 
R=h¥hh + Sh his = gh hii. (101) 


tsb 


Here the Greek indices runs from 0 to 3 while the Latin indices from 1 to 3. Since 
what we need is a perturbed action in cosmological background with the metric 


ds? = a?(n)[—dn? + (dij + hij)dx*da?] = Gy, ,de"de”. (102) 


We make use of a conformal transformation 9, = 27g, to calculate the desired 
quantity. The Ricci tensor transforms as 


Ruy = Ruy — 20nQ) ye — Guvg?" (MQ) ;70 
+ 2(in zu (ln Dy — guvg?” (In) ,o(In.Q),-. (103) 
Taking Q = a(7), we find the Ricci scalar in the metric (102) is given by 
" / 
R=a” (R+0o - 32 n%n4). (104) 
a a 


Since we wish to calculate properties of quantum gravitational waves generated 
during inflation, let us assume inflation is driven by constant vacuum energy density, 
which is equivalent to a cosmological constant A, and calculate the second-order 
action for hj;, which reads 


MB f _, 
Sor = (R —2A)./—gd*a 


second-order 
M2 i 4 Se 
= Me / dnd?ca? (hi hd — ni he"), (105) 


where we have used Aa* = 2aa” — a”. 
This action has the same form as a free scalar field again, so we can quantize it in 
the same way as before. Indeed taking new variables as zp = a/2, uj; = Mezrhi;, 


we find 
1 3 12 2 a” 2 
Sars [ane x lus — (Vuig)" + —uis |, 0b) 


which is indeed of the same form as (70). 
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For exact de Sitter space with e4 = 0 we have a = —1/(Hn), whereas in case 
eH # 0, one can express the scale factor around an arbitrary time 7, using the 
Hubble parameter at that time H,. as 


€ 
typ —— 
1 1 —n \? "7 3 3 
= == . 107 
“ ar (=) ye 21-€ ay) 
Therefore, we find the normalized mode function 
i 
TN 2 
uy = (-) Hy) (—kneG(k), A=+,%, (108) 


where ef (k) is a polarization tensor that satisfies (ef (k)e*7? (k)) = 64%. 
Thus, the power spectrum of long-wave tensor perturbation is given by 


= AnH 5 yet Se a _ (109) 
~ (2r)3 Qn)? M22, n? M2, 
with the spectral index 


Pr(k) 


— dinPr(k) _ 
=a 2€H. (110) 
Finally the ratio of this power spectrum to the curvature perturbation, 
Pr(k) 
= = l6Oc,€H = —8Cs 111 
r Prk) Cs€H CsMt (111) 


is called the tensor-to-scalar ratio. It is one of the most important quantity subject 
to ongoing and future observational tests of inflation. In Sec. 11, we present the 
relation between r and n; in more generic theories. 


10. The Most General Single-Field Inflation 


So far we have considered a rather limited class of inflationary universe models 
including only potential-driven slow-roll inflation and k-inflation. But theories con- 
sisting of tensor and a single scalar degrees of freedom have much more functional 
degrees of freedom under the conditions that they are ghost free and stable. In fact, 
the most general single-field action with gravity which yields second-order field 
equations was formulated by Horndeski®® back in 1974. A theory constructed with 
the same objective but totally different appearance was also proposed by covari- 
antizing theories with Galilean invariance®® 0,,¢ — 0,¢ + b, and referred to as 
generalized Galileon.®’ The equivalence of two theories was shown in Ref. 56. 

Here, making use of this theory, we provide a comprehensive and thorough study 
of the most general noncanonical and nonminimally coupled single-field inflation 
models yielding second-order field equations based on Ref. 56, which is called gen- 
eralized G-inflation since it is a natural and comprehensive extension of G-inflation 
model,®° which was the first inflation model using Galileon-like higher derivative 
theory. 
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We use the modern notation of Ref. 97, which contains four arbitrary functions 


of @ and the canonical kinetic function X = —0,,60"@/2 as 
Ly = K(¢,X), (112) 
L3 = —Gs(¢, X) 0, (113) 
La = Gad, X)R + Gax(¢, X)[(6)? — (VuVi9)"], (114) 


Ls = Gs (¢, XG awV"V" o 


— 5G5x (6, X)[(O9)* - 3047.9)? +2(0,¥.8)9}, (115) 


where G,,, is the Einstein tensor, (V,V.¢)? = (Vi Vi¢)(V4AV"¢), (ViVid)? = 
(Vi.Vid)(V“V>¢)(V,V4¢), and Gix = OG; /OX. 

The Lagrangian of the Einstein gravity may be automatically included by taking 
G4 = M2,/2, and the extension to a nonminimally coupled scalar field is also 
immediate. Thus, the total action we consider is given by 


5 
S= > f atey=se, (116) 
1=2 


which is the most general single scalar theory resulting in equations of motion 
containing derivatives up to second-order. Then the standard slow-roll inflation 
can be achieved by K = X — V[@] with all the other terms except for the above- 
mentioned Einstein term equal to zero. It can also realize other inflation models 
easily; k-inflation® is realized by K(¢,X), G-inflation®® by K and G3 terms, and 
even New Higgs inflation,®® which couples the Einstein tensor with the scalar kinetic 
term, can be realized taking an appropriate form of G5. We also note that the 
curvature-square inflation!” or more general f(R) inflation can also be recast in the 
present form by defining a new field as ¢ = df/dR. 

Below we describe field equations for the homogeneous background and present 
two examples based on Refs. 56 and 99. 


10.1. Homogeneous background equations 


The background equations of motion can be derived from (116) taking ¢ = d(t) 
and the spacetime metric ds? = —N?(t)dt? + a?(t)da?. Variation with respect to 
N(t) gives the constraint equation corresponding to the Friedmann equation, which 
can be written as 


5 
peer (117) 


where 


Ey =2XKx — K, (118) 
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€3 = 6X bHG3x — 2XGag, (119) 
£4 = —6H?G4 + 24H? X (Gax + XGaxx) —12HX¢bGagx —6HbGag, (120) 
Es = 2H? Xd(5Gsx + 2XGsxx) —6H?X (3Gs¢ + 2X Gs9x). (121) 


Being the constraint equation, the above terms contain derivatives of the metric 
and the scalar field only up to first-order. 

The evolution equation is obtained by differentiation with respect to a(t), which 
reads 


Rae (122) 
i=2 
where 
Py = K, (123) 
P3 = —2X (G4 Eh bG3x), (124) 


Pa = 2(3H? + 2H)G4 — 12H? XGax —4HXGax — 8HXGax — 8HX XGaxx 
+ 2(b + 2Hd)Gag + 4X Gags + 4X (¢ — 2H4)Gugx, (125) 
Ps = —-2X (2H°6 + 2HH¢ + 3H7$)Gsx — 4H? X°6Gsxx 
+4HX(X — HX)Gsox + 2[2(HX) + 3H?X|Gsg + 4HX4Gs6¢- (126) 
Although the background quantities €; and P; are defined in an analogous way in 
which the energy density and the isotropic pressure of a usual scalar field are defined, 
the partition between the gravitational and scalar-field parts of the Lagrangian is 
ambiguous, because the gravitational contributions are treated with the matter 
contributions on an equal footing in the above expressions. Indeed, one can see 
that, for Gy = M2./2, €, and Py reduce to the Einstein tensor multiplied by —1: 
E, = —3M2.H? and Py = M2(3H? + 2H). 
Finally, variation with respect to ¢(t) gives the scalar-field equation of motion, 
=< (a J) = Py, (127) 
where 
J = 6Kx + 6HXG3x — 26G3y + 6H?d(Gax + 2X Gaxx) — 12HX Gagx 
+ 2H°X (3G5x + 2XGsxx) — 6H?b(Gsg + XGsox) (128) 
and 
Ps = Kg — 2X (Geo + 6G3gx) + 6(2H? + H)Gag + 6H(X + 2HX)Gagx 
— 6H? XGsgo + 2H°X bGsox- (129) 


The solutions of the above equations admitting accelerated cosmic expansion 
constitute generalized G-inflation. We present two examples below. 
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10.2. Kinetically driven G-inflation 


If we impose shift symmetry, 6 — @+ ¢, the scalar field has no potential. In this 
case, the field equations are 


3 & = oJ — K —6H?(Gs — 2XGix) +40 XdGsx =0, (130) 
1=2 
5 
YUE: + Pi) = bs ~ 2XbGax + 252M (Gi — 2XGix) ~ PXSGsx] = 0, 
_ (131) 
S08) = 0, (132) 


From Eq. (132), one immediately finds that J « a~* — 0. Shift-symmetric models 
thus have an attractor, J = 0, along which H = const., ¢ = const., satisfying 


oKx +6HXG3x + 6H7d(Gax + 2XGaxx) 
+ 2H®X (8Gsx + 2XGsxx) =0, (133) 
K + 6H?(Ga — 2XGax) — 4H°X¢Gsx =0. (134) 


Provided that Eqs. (133) and (134) have a nontrivial root, H 4 0, ¢ 4 0, we obtain 
inflation driven by ¢’s kinetic energy. 
As a simple example, let us take 
Xx? x 
k=-X+2—, G3;=—, 
a5 2M3 yn ’ 3 ive ’ 
with M and p being constants with unit mass dimension. Writing the field equations 
for this Lagrangian, we can easily find a de Sitter solution with a constant ¢, 


X = M3 uz, 


(135) 


2 (136) 
yx M3 (1-2) 


a ae 
where x (0 < x < 1) is a constant satisfying (1 — x)/x,/1 — 2/2 = V6u/Mc. For 
uu < Me, it can be seen that x ~ 1 — V3u/Me and hence the Hubble rate during 
inflation is given in terms of M and p as H? ~ M%/(6MZ). We can show that 
this inflationary solution is an attractor of the theory. 

Inflation can end if the shift symmetry is broken at some ¢ where the first term 
of kK changes sign. Then the universe will soon become dominated by the kinetic 
energy of the scalar field, which decreases in proportion to a~® after the higher- 
order term in K becomes negligible compared with the first term. Since the shift 
symmetry of the original Lagrangian prevents direct interaction between @ and 
standard-model sector, reheating proceeds only through gravitational particle pro- 
duction.”*-”9 The reheating temperature in such a case is given by (58). 
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10.3. Potential-driven slow-roll G-infiation 


Next we consider the opposite extreme. Suppose that the functions in the 
Lagrangian can be expanded in terms of X as 


K(9,X) = -V(@) +K(@)X +---, (137) 
Gi(b, X) = gil) + hil O)X +-°-, (138) 


and consider the case in which the inflaton field value ¢(¢) changes very slowly. In 
this case, the potential term manifestly breaks the shift symmetry, so that the model 
is capable of a graceful exit from inflation.!°° Neglecting all the terms multiplied 
by ¢ in the gravitational field equations, we obtain 


5 5 
DOP ~~ DE ~ -V(d) + Gaal d)H?, (139) 


where we have assumed 
|H|<H? and |d|<|HdI. (140) 


We may thus have slow-roll inflation with 


pa”. (141) 
694 
During slow-roll, we approximate 
|\J|<<|HJ|, |Gi|<«|Hgil, |hil < |Hhil. (142) 


Under the above approximation, we have the slow-roll equation of motion for ¢, 
3HJ ~ —V5 + 12H? Q1g, (143) 
with 
J ~ Ko — 29366 + 6(Hh3X + H?had — H? 9596 + H°h5X). (144) 


Which term is dominant in Eq. (144) depends on the magnitude of the coefficients 
hi(@) of X. Note here that we can set g3 = 0 and gs = 0 without loss of generality, 
because g34 can be absorbed into a redefinition of K and gs5g into ha, that is, 
K- 2936 = K, ha — 95¢ 7 ha. 

If we take K = 1 and gy = M2/2 with all the other coefficients equal to zero, 
we recover the standard slow-roll inflation in the Einstein gravity driven by the 
potential V(¢). Identifying @ with the real neutral component of the standard 
Higgs field, we can reproduce various types of Higgs inflation by taking each of the 
remaining functions as follows: 


K=1+«¢?", running kinetic inflation,’°! (145) 
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od 


h3(¢) = Me Higgs G-inflation,‘° 146) 

_ Me 3 2 riginal Hig inflati 103-105 147 

ga() = : 5? , original Higgs inflation, ) 
1 

ha(d) = Be new Higgs inflation,°® 148) 

hs(o) = 7 running Einstein inflation.°? 149) 


Thus all types of Higgs inflation models can be expressed in the framework of 
generalized G-inflation. Furthermore, combinations of these models can also be 
analyzed coherently in this context.°? 


11. Power Spectrum of Perturbations in Generalized G-inflation 


Next we incorporate scalar and tensor perturbations into the homogeneous solution 
again based on Ref. 56. To do this we consider the perturbed metric 


ds? = —N?dt? + 4; (da* + N*dt)(da? + Nidt), (150) 
in the unitary gauge ¢ = ¢(t), where 


1 
N=l+a, N=O8, yj = a?(t)e*S (5: + hij + shina; (151) 


Here a, 3, and ¢ are scalar perturbations and h,; is a tensor perturbation satisfying 
hig = 0 = Ayj,7- With the above definition of the perturbed metric, \/—g does not 
contain h;; up to second-order, and the coefficients of ¢? and a¢ vanish, thanks to 
the background equations. 


11.1. Tensor perturbations 


The quadratic action for the tensor perturbations is found to be 


gO) = : fl dtd? a? lorie, - FF whis)? (152) 
where 
Fr = 2[Gs — X(¢Gsx + Gse)], (153) 
Gr = 2[Gs — 2XGax — X(HOGsx — Gs¢)]. (154) 
One may notice that Gr can also be expressed as 
1a; 
Or =; d, we (155) 


The squared sound speed is given by 


a=. (156) 
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One sees from the action (152) that ghost and gradient instabilities are avoided 
provided that 


Note that c#, is not necessarily unity in the general case, contrary to the standard 
inflation models. 
To canonically normalize the tensor perturbation, we define 


Cc a 
dyr = —at, a= g(FrGr), Vig = erhaj, (158) 
and then the quadratic action is written as 
(2) _ 1 3 PAD ANG Ea 2 
3) = = | dyrddx | (vf)? — (Voig)? + 203), (159) 
2 J 27 d 


where a prime denotes differentiation with respect to yr. In terms of the Fourier 
wavenumber k, sound horizon crossing occurs when k? = z/,/zr ~ 1/y for each 
mode. 
On superhorizon scales, the two independent solutions to the perturbation equa- 
tion that follows from the action (159) are 
d 
Vig X 2p and zr us (160) 
ay 
In terms of the original variables, the two independent solutions on superhorizon 
scales are given by 


© dt’ 
hij = const. and / Bon (161) 
The second solution corresponds to a decaying mode. 
To evaluate the primordial power spectrum, let us assume that « = —H/H? ~ 
const., 
Fr ‘4 
= ~const. and = — ~ const. 162 
fr=F Fi I = FG" (162) 


We also define the variation parameter of the sound velocity of tensor perturbations 
as 
— A CT 


il 
sr = Ge = 5 (fr - 97). (163) 
Clearly only two of the three parameters are independent. We additionally impose 
conditions 


fr. 9° 
l—-e—-—+4+— 164 
> 5 > 4, (164) 


3-—e+gr>0. (165) 
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The former (equivalent to 1 > €+s7) guarantees that the time coordinate yr runs 
from —oo to 0 as the universe expands. The latter implies that the second solution 
in (161) indeed decays. We see that zp can be written as 


(—yrx) 


(166) 


where the quantities with « are those evaluated at some reference time yr = yr. 
The normalized mode solution to the perturbation equation is given in terms of the 
Hankel function: 


Tv 
vij = VE GH) (—kur) ex, (167) 


where 


_ 3—-—e+9r 
2-2 -—frt+gr’ 


VT (168) 


and e;; is a polarization tensor. Notice that the conditions (164) and (165) guarantee 
the positivity of vr. On superhorizon scales, —kyr < 1, we obtain 


T a 1/2-vr 
k3/2h,, we gur-2 (vr) ( yr) k3/2-Y7 @, (169) 
r(2 a 
2 
Thus, we find the power spectrum of the primordial tensor perturbation: 


ee 
FR? Arr? 


; (170) 
—kyr=1 


Pr = 8yr 


where yr = 2?"7~3|['(vr)/T(3/2)|2(1 — € — fr/2 + gr/2). The tensor spectral tilt 
is given by 


nT = 3- 2p. (171) 


Contrary to the predictions of the conventional inflation models, a blue spectrum 
np > 0 can be obtained if the following condition is satisfied: 


4e+3fr—9r <0. (172) 


This condition is easily compatible with conditions (164) and (165). Thus, positive 
and large gr compared with ¢ and fr can lead to a blue spectrum of tensor per- 
turbations. In deriving the above formulas, we only assumed that ¢, fr, and gr 
are constant. These parameters may not necessarily be very small as long as the 
inequalities (164) and (165) are satisfied under a sensible background solution. 
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11.2. Scalar perturbations 


We now focus on scalar fluctuations setting hij = 0. Plugging the perturbed metric 
into the action and expanding it to second-order, we obtain 


Ss? = / dnaa® | 3Gr¢? 4 = (VO)? + Ba? 


2 v2 ; 2 
200.58 | 2Gn¢~ 3+ 60a¢ at ; (173) 
a a a 


where 
Wa Xho + OO Rye + AEX Gae 
+ 6HbX°G3xx — 2X G34 — 2X°G3ex — 6H?Gs 


+ 6[H?(7X Gax + 16X?Gaxx +4X°Gixxx) 
_ Hd(Gag + 5X Gagx + 2X°Gagxx)| 
+ 30H°dXGsx + 26H°A6X?Gsxx 


+4H°bX°Gsxxx — 6H?X (6Gsg + 9XGsox + 2X?Gsexx), (174) 
O= —$XGsx + 2HG, — 8HXG4x —- SHX’Gaxx + bGag + 2X bGaex 
— H?$(5X Gsx +2X?Gsxx) + 2HX (3Gs¢ + 2X Gsox). (175) 


It is interesting to see that even in the most generic case, some of the coefficients 
are given by Fr and Gr, i.e. the functions characterizing the tensor perturbation, 
and only two new functions show up in the scalar quadratic action. Note that the 
following relations hold: 


5 5 
be; 1 OE; 
= ; 1 
y x) OX 34 DoH (176) 
5 
ive 
__! 177 
iS) 6 2 OH’ (177) 


which compactify the above lengthy expressions. 
Varying the action (173) with respect to a and (, we obtain the constraint 
equations 


2 2 
Da e. B+30¢ —Gr te ¢=0, (178) 
a a 
Oa —Gré = 0. (179) 


Using the constraint equations, we eliminate a and (3 from the action (173) and 
finally arrive at 


Ss = i dtd? a? asc? ~ Fs ge, (180) 


a2 
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where 
_ld/a_, 
Fs = cay (6%) ~ Fr oe 
X 2 
Gs = SG} + 3Gr. (182) 


The analysis of the curvature perturbation hereafter is completely parallel to 
that of the tensor perturbation. The squared sound speed is given by c2, = Fs/Gs, 
and ghost and gradient instabilities are avoided as long as 


Fs>0, Gs>0. (183) 


In the case of standard potential driven inflation and k-inflation, where G3 = 0 = 
Gs and G4 = M2./2, we find Fs = Mée. This means that positive His prohibited 
by the stability requirement.!°° In the generalized G-inflation, however, we may 
realize positive H without violating the stability condition. This point was already 
clear in G-inflation and kinetic gravity braiding for which G3 4 0.°°:1°” Stable 
cosmological solutions with H > 0 offer a radical and very interesting scenario of 
the earliest universe.!0% 11 


Using the new variables 
dys = Fat, 25 = V2a(FsGs)"/", w= 256, (184) 


the curvature perturbation is canonically normalized and the action is now given 
by 


1 if 
s? =. J eusate Ga — (Vu)? + 07, (185) 


where a prime denotes differentiation with respect to ys. Each perturbation mode 
exits the sound horizon when k? = z{/zg ~ 1/y2, where k is the Fourier wavenum- 
ber. 


The two independent solutions on superhorizon scales are 


dt’ 


aes (186) 


t 
¢=const. and / 
During inflation, it may be assumed that Gg is slowly varying. In this case, the sec- 
ond solution decays rapidly. Note however that the nontrivial dynamics of the scalar 
field can induce a temporal rapid evolution of Gg, which would affect the super- 
horizon behavior of the curvature perturbation through the contamination of the 
second mode in the same way as in Refs. 112-115. Given the specific background 
dynamics, one can evaluate such an effect using our general formulas. 
Closely following the procedure above for the tensor perturbations, we now 
evaluate the power spectrum of the primordial curvature perturbations. We assume 


II-260 K. Sato and J. Yokoyama 


that € ~ const.,* 


fs= Hfs ~const., gs = aes ~ const. (187) 
and then define 
3—e+ 98 
ve => ————_——, 188 
5 2 e— fabs veel 
The power spectrum is given by 
1/2 779 
_%s Gg HH” 
PES i gielee F (189) 
Ss —kys=1 


where yg = 2?"8~3|[(vg)/T'(3/2)|?(1 — € — fs/2 + 9s/2). The spectral index is 
Ns —1=3-2v¢. (190) 


An exactly scale-invariant spectrum is obtained if 
3 1 
—fg——gs = 0. 191 
e+ 7fs— 498 (191) 


Here again, €, fs, and gg are not necessarily very small (as long as n, — 1 ~ 0). 
Taking now the limit ¢, fr, gr, fs,gs <1, the tensor-to-scalar ratio is given by 


Fg\?? (s) Fg tg 
r=16| — a = 16— —. 192 

(#2) Gr Fr cr ne) 
In the case of potential-driven slow-roll inflation considered in Sec. 10.3, we need 


have to leading order in slow-roll 


xX AbX 


Fg ~ Fa(K + 6H*ha) + (hs + H7hs), (193) 
x 6dX 
Go wk + 6H7h4) (h3 + H7hs) (194) 


and Fr ~ Gr ~ 2g4, where we used the slow-roll equation 2g4¢ + g4/H ~ oJ/2H?. 
In this case, we have c7, ~ land np ~ —(2€+gr) with fr ~ gr ~ ga/ (Hoga). If the K 
or hg term dominates in J, we have c2 ~ 1 and Fs ~ Gs ~ Jo/(2H?) ~ ga(2e+gr), 
which yields the standard consistency relation: 


r~—8nr. (195) 


It is often argued that a negative np is a generic prediction of inflationary 
cosmology, and that if observationally a positive np was confirmed, it would falsify 
inflation. It is clear from the above discussion that such a statement is incorrect. 
The signature of tensor spectral index is not a discriminant of inflation and exotic 
cosmology but simply that of standard inflation and generalized G-inflation. 


@By defining the variation parameter of the sound velocity of scalar perturbations as sg = 
és/(Hes) = (fs — gs)/2, the formulae with fg and/or gg can be rewritten in terms of cg. 
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12. Inflationary Cosmology and Observations 


Various predictions of inflationary cosmology are now confronting observational 
tests using CMB anisotropy and large-scale structures. Since the first-year WMAP 
results were disclosed,°?!!6"1” the cosmological parameters of homogeneous and 
isotropic universe as well as properties of fluctuations have been determined quite 
accurately with well chaaracterized error bars. Indeed, until the WMAP papers were 
published, although we had the concordance cosmology, we did not have any reliable 
error estimates because the concordance values of the cosmological parameters had 
been obtained by combining a number of different observational results and we did 
not have means to calculate error bars to make a sensible weighted average. 

Current precision cosmology makes use of the Markov Chain Monte Carlo 
(MCMC) method to constrain values of cosmological parameters as well as the 
amplitude and spectrum of primordial fluctuations using temperature and E-mode 
polarization data of CMB.!!® Note, however, that the final values of these param- 
eters as well as the magnitude of errors obtained by these procedures change con- 
siderably depending on which observational datasets one uses as well as which 
parameters are included in the model. 

Recent cosmological data as a whole show good agreement with spatially flat 
universe with a nonvanishing cosmological constant (A) and cold dark matter 
(CDM) based on simple single-field inflation paradigm that predicts almost scale- 
invariant adiabatic fluctuations. Thus it makes sense to estimate various parameters 
of ACDM model using MCMC techniques. 

Below we quote results from the seven- and nine-year WMAP!!®:1!9 denoted 
respectively by W7 and W9 as well as Planck 2013°%1?° (P13) and 2015121122 
(P15) results to give an idea about what constraints are likely to survive in future 
independent of the subtleties of each experiment and accumulation of the data. 
Abbreviations in parentheses after each quoted number refers to the dataset used, 
whose meaning is self-explanatory. First as the most important test of the global 
spacetime, the spatial curvature is constrained as 


0.0133 < Axo = — ae < 0.0084 (W7HSTBAO 95%CL), 
0.079 < Axo < 0.007 (W9 68%CL), 
0.0065 < Xx < 0.0012 (W9eCMBBAO Hp 68%CL), 
~0.09 < Xxo < 0.001 (P13WPhighL 95%CL), oe 
0.0075 < Axo < 0.0052 (P13lensWPhighLBAO 95%CL), 
~0.019 < Axo < 0.011 (PISTTTEEElens 95%CL), 
0.0039 < Axo < 0.0048 (PISTTTEEElensExt 95%CL), 


which is consistent with (9). Hence from now on we fix Oxo = 0. 
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The amplitude and the spectral index of curvature perturbations measured by 


WMAP read 
Pr(ko) = (2.48340.11)x10-°, n,=0.967+ 


Pr(ko) = (2.430 + 0.091) x 107°, ns = 0.968 + 


0.014 (W7 68%CL), 


0.012 (W7HSTBAO 68%CL), 
(197) 


Pr(ko) = (2.4140.10) x 1079, n, =0.972+0.013 (W9 68%CL), 


Pr(ko) = (2.42770:078) x 1079, ns =0.971+0.010 (W9BAOH) 68%CL), 


(198) 


where the pivot scale is taken as ky = 0.002 Mpc7!. 
Planck, on the other hand, takes the pivot scale at kj = 0.05 Mpc~! and finds 


Pr(ko) = (2.19579 js) x 107°, ns = 0.9603 + 0.0073 (P13WP 68%CL), 


Pr(ko) = (2.20010 '929) x 107°, ns = 0.9608 + 0.0054 


(P13WPhighLBAO 68%CL), 


(199) 


Pr(ko) = (2.20710578) x 107°, ns = 0.9645 + 0.0049 


(PI5TTTEEElowP 68%CL) 


d 


Pr(ko) = (2.20440-078) x 107-9, ns = 0.9673 + 0.0045 


(P15TTlowPBAO 68%CL). 


(200) 


If we allow scale-dependence of the spectral index, the values change considerably as 


dns 
Me = 1.02742-089 “s_ — _0.03440.026 (W7 68%CL), 
dink 
(201) 
Ns = 1.008 + 0.042, ak = —0.022+0.020 (W7HSTBAO 68%CL), 
ns = 1.009 + 0.049, dns = —0.019+0.025 (W9 68%CL), 
dlnk 
dns 
ns = 1.020 + 0.029, a = —0.023+0.011 (W9eCMBBAOHp) 68%CL). 
(202) 
These numbers refer to the values at kg = 0.002 Mpc~!. Planck finds 
dns 
“s — 0.0134 0.009 (P13WP 68%CL), 
dlink 
4 (203) 
“s — 0.0114 0.008 (P13lensWPhighL 68%CL), 


dink | 
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dns 
“s_ — —0,0057 + 0.0071 (P1I5TTTEEElowP 68%CL), 
dink 
(204) 
“s_ — —0.0033 + 0.0074 (P15lowPlens 68%CL), 
dink 
on their pivot scale kj. Thus there is no need to consider inflation models with 


appreciable running any more.!2? 175 


The amplitude of tensor perturbations, which tells us the energy scale of infla- 
tion, is also an important quantity to distinguish inflation models. If we incorporate 
tensor perturbations in power-law ACDM model the constraints are 


r<0.36 (W7 95%; no running), 

r<0.24 (W7BAOH) 95%; no running), 

r<0.38 (W9 95%; no running), 

r<0.13  (W9eCMBBAOH) 95%; no running), (205) 
r<0.11 (P13WPhighL 95%; no running), 

r<0.11 (P15TTlowPlens 95%; no running), 

r<0.10 (PI5TTTEEElowP 95%; no running), 


using the pivot scale ky = 0.002 Mpc’ 


index the above constraint relax substantially to 


, whereas if we allow a running spectral 


r<0.49 (W7 95%; with running), 

r<0.49 (W7BAOA) 95%; with running), 

r<0.50 (W9 95%; with running), 

r<0.47 (W9eCMBBAOH) 95%; with running), (206) 
r<0.26 (P13WPhighL 95%; with running), 


r<0.18 (P15TTlowP 95%; with running), 
r<0.15 (PIS5TTTEEElowP 95%; with running). 


Thus the above constraints change significantly if we allow running spectral index 
of curvature perturbation or not, since they are indirectly obtained through the 
temperature anisotropy and the E-mode polarization. 

Because of this sensitivity to the nobel assumptions, it is important to constrain 
r directly through B-mode polarization of CMB, which is not generated by scalar 
perturbations so that the result does not rely on assumptions concerning the spec- 
trum of curvature perturbations. In March 2014, BICEP2 collaboration announced 
possible detection of primordial tensor perturbations through their observation of 
B-mode polarization of CMB at the South Pole,!*° favoring a fairly large value of 
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r ~ 0.2, namely high-scale inflation. It turned out, however, that their result was 
significantly contaminated by the galactic dust and large r is no longer favored. The 
joint analysis of BICEP2/Keck Array data and Planck 2015 data was performed 
obtaining the upper bound, ro.95 < 0.12 at 95% CL.!?7 Thus the current constraint 
from B-mode polarization is at the same level as those obtained indirectly from 
temperature anisotropy and E-mode polarization with the power-law model. 

These constraints are often graphically displayed in a (ns,7r) plane!?® 
and two-sigma contours and compared with various inflation models. We do not 
do the same here for two reasons. First constraints vary appreciably depending on 
what dataset one uses and what underlying model one adopts as seen above. Second 
occupying center of the likelihood contour at the current level of observations does 
not necessarily mean that such a model is the right one, and we should be open 
minded for further development of the observations. 

Below we instead calculate these observables in each class of models discussed 
in Sec. 4. 


with one- 


12.1. Large-field models 


Consider a massive scalar model as an example of large-field models. The number 
of e-folds of inflation after ¢ crossed @y is given from (31) as 


l 2 
N=I (=) (207) 
Hence the slow-roll parameters are 
Me\*? 1 
ev =m =2( 28) = ON’ é&v =0 (208) 


which gives the amplitude of curvature perturbation, (95), 
et ea ; (209) 
RY 6n2 \ Me ) 


Setting N = 55 at the pivot scale tentatively, we find m = 1.6 x 10! GeV and the 
quartic coupling is constrained as \ < 8 x 107!°. 
The spectral index and its running are given by 


ii oo, ™ : 


- _2 —4 
i anh ye = OKI, (210) 


in agreement with the observations. The amplitude of tensor perturbation, r = 
16€ = 0.15, is so large that it is now strongly disfavored by observations.!?” It is 
remarkable that the simplest and most attractive model of inflation is now about 
to be ruled out by the observation. 


Inflationary cosmology: first 30+ years TI-265 


Let us also consider R? inflation in the Einstein frame as a kind of large-field 
models. From (23), (24) and (27), we find 


3 z. 4 1 3 
N®& rev aren = ee 
4° ad 3 (eV anon 1)2 4N2’ 
4 eVaren 9 1 211) 
nv = = 


which yield 
2 
Ms = 1— 5 = 0.964, r= 5 =40x107, (212) 


for N = 55. 


12.2. Small-field model 
Next we consider a small field model (32). From V[¢] = 4(¢? — v?)? the slow-roll 


parameters are given by 
8M2.¢7 _ 4M2(3¢? — v?) —_ 96MA¢? 
Goa aa aa 


Since inflation ends when ez; = ey = 1, the field value, gy, at that time is given by 


oF =v" +4MZ — \/16M2, + 8M2v? & (8? — 2V28) MG, (214) 


where we have set v = GMg and the last approximate equality holds when 3 => 10. 
The number for e-folds from ¢ = dy to of reads 


dp 6. of op 1 
= PB ier aM gn 2 


The last approximation is valid when @ ~ 10 — 20. The amplitude of curvature 
perturbation is given by 


Moy —v?)* ABS (Me BF BN 
Pr(ko) = 76872MS¢2, ~ AX (Me) 76872(3 — /2)? ,00(B +1), 


ev= (213) 


oR) =F (InSL 5). cars) 


(216) 


where the second and third approximations are based on dy < v and ¢¢ & (8 — 
V2)Me, respectively. The spectral index and its running are given by 


8(3¢n +u7)MZ 8 


ng 1 =o =e (217) 
dn, _ (3200242, + 19244) M4 320 ( on I vig 
dink — (2, — v?)4 — 68 \ Me a, 
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For @ = 15 and N = 55, for example, we find n, = 0.964 with negligibly small 
running, and the self-coupling is given by \ ~ 7 x 1074. 


12.3. Hybrid inflation model 


Here, we consider the case where vacuum energy drives inflation, which is distinct 
from the above two classes. The potential in this regime is approximated as 


Ts 
V[d] = Vo + ee (219) 
and so the slow-roll parameters are given by 
1 ¢m\4/( @ m 
= — a ———— SS — - 22 
eS as & (3%) » WV = sr bv =0 a) 


In this case one cannot determine the energy scale of inflation from the amplitude 
of curvature fluctuations alone. The spectral index is usually determined by ny as 


2m? 
which is inconsistent with current observations. The running spectral index takes 
a small value 


dn am?\? / o \? ¢\? 
ae Ti =| | (ng TP | 22) 
dink VV (Fr) (5) 7) (35) aa) 
so does the tensor-to-scalar ratio, 
2m? : Q a 2 Q ° 
= 2 (| —~ —} ~(n,-1 — |}. 22 
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One should note that if the second term dominates over the vacuum energy in 
(219), the prediction becomes closer to a large-field model and the spectral index 
can be smaller than unity. 


12.4. Noncanonical models and multi-field models 


So far we have focused on the amplitude and spectral index of primordial curvature 
and tensor perturbations. Indeed in single-field slow-roll inflation there is only one 
fluctuating degree of freedom, so only adiabatic fluctuations are generated with no 
isocurvature counterparts. Furthermore, the curvature perturbation generated in 
slow-roll inflation behaves as a practically free massless scalar field during infla- 
tion, and so its vacuum fluctuation is Gaussian distributed. This is why the power 
spectrum fully quantifies the properties of fluctuations. 

In some classes of noncanonical inflation models including DBI inflation,!?98° 
ghost inflation,!3! k-inflation®4 and G-inflation,®® the inflaton may change rapidly 
and its interaction may not be negligible. In such circumstances deviation from 
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Gaussian statistics may be significant.20-!82 135 Furthermore, in models with multi- 
ple fluctuating fields besides the inflaton, one may also expect large deviation from 
Gaussianity. 136 188 

Much work was done in recent years to quantify the skewness or the bispectrum 
of the temperature anisotropy, and a number of contrived models were proposed 
that predicted an observationally falsifiable magnitude of the deviation from Gaus- 
sianity by Planck.!8° The constraints on non-Gaussianity provided by analyzing the 
Planck data!?° falsified most of these models that had been proposed predicting 
large bispectral non-Gaussianity; therefore, we do not discuss this issue in more 
detail. 

Another possible signature of multi-field models would be the presence of isocur- 
vature modes. But again CMB observations do not need any isocurvature compo- 
nents at least on relevant scales.119 


13. Conclusion 


The fact that we live in a big and old universe with a large amount of entropy can be 
regarded as a primary consequence of inflation in the early universe. Any attempt 
to avoid inflation by providing alternative explanations to the other predictions of 
inflationary cosmology, namely, generation of adiabatic density perturbation with 
nearly scale-invariant spectrum as well as tensor perturbation should also have 
sound explanation to the aforementioned global properties of the spacetime. 

Some people, however, try to refute this viewpoint claiming that the global prop- 
erties of the universe, namely the homogeneity, isotropy, and longevity had been 
known long before the first inflation models!?:!®!" were proposed so that they can- 
not be regarded as the predictions of inflationary cosmology. While this statement 
is correct, one should also recognize that theories in physics have a double role. The 
first is to make new prediction which may be confirmed or falsified by experiment 
and observation. The second is to provide sensible explanations to known myster- 
ies of the nature. Inflationary cosmology certainly provides a simple mechanism 
to explain fundamental properties of the global structure of the space, playing the 
latter role of theoretical physics. Thus the above mentioned criticism is unjustified. 

Furthermore, the simplest class of slow-roll inflation models predicted nearly 
scale-invariant, adiabatic, and Gaussian curvature and density fluctuations, all of 
which are now being confirmed by dedicated CMB experiments. In particular, the 
observed angular correlation between temperature anisotropy and E-mode polariza- 
tion clearly indicates that the observed fluctuations were generated as super-Hubble 
fluctuations as predicted by inflation. 

We are tempted to interpret the fact that Planck did not confirm any sizable 
non-Gaussianity!*° despite the recent efforts of many theorists to make models of 
non-Gaussian fluctuations as indicating that nature preferred the simplest inflation 
models. This situation, on the other hand, highlights the difficulty of singling out 
the right model responsible for our universe by exploiting the small numbers of 
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observational clues available. For example, among the models occupying central 
regions of the likelihood contours in (n,,r) plane now, it is very difficult to dis- 
tinguish between R? model!”!41:142 and a Higgs inflation model with a large and 
negative nonminimal coupling using observations of perturbation variables 
alone. We should combine with the analysis of the reheating processes of these mod- 
els for more detailed comparison, which may be observationally probed by future 
space-based laser interferometers such as DECIGO.!43:144 

On the other hand, the latest observational upper limits on B-mode polarization 
of CMB is ruling out the simplest inflation model, namely, chaotic inflation with a 
massive scalar field.!?’ This is somehow a shocking result since we also have sensible 
particle physics models for this inflation mechanism. This may indicate that our 


103-105 


understanding of the gravity sector at such high energy scale may be wrong, as 
there is no reason believe that the Planck scale remains intact up to the Planck 
scale. 
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At its very beginning, the universe is believed to have grown exponentially in size via the 
mechanism of inflation. The almost scale-invariant density perturbation spectrum pre- 
dicted by inflation is strongly supported by cosmological observations, in particular the 
cosmic microwave background (MB) radiation. However, the universe’s precise inflation- 
ary scenario remains a profound problem for cosmology and for fundamental physics. 
String theory, the most-studied theory as the final physical theory of nature, should 
provide an answer to this question. Some of the proposals on how inflation is realized 
in string theory are reviewed. Since everything is made of strings, some string loops of 
cosmological sizes are likely to survive in the hot big bang that followed inflation. They 
appear as cosmic strings, which can have intricate properties. Because of the warped 
geometry in flux compactification of the extra spatial dimensions in string theory, some 
of the cosmic strings may have tensions substantially below the Planck or string scale. 
Such strings cluster in a manner similar to dark matter leading to hugely enhanced 
densities. As a result, numerous fossil remnants of the low tension cosmic strings may 
exist within the galaxy. They can be revealed through the optical lensing of background 
stars in the near future and studied in detail through gravitational wave emission. We 
anticipate that these cosmic strings will permit us to address central questions about 
the properties of string theory as well as the birth of our universe. 


Keywords: Inflation; string theory and cosmology; cosmic strings; early universe; gravi- 
tational microlensing. 


PACS Number(s): 98.80.Bp, 98.80.Cq, 98.80.Es, 11.25.—w, 11.27.+d 


1. Introduction 


By 1930, quantum mechanics, general relativity and the Hubble expansion of our 
universe were generally accepted by the scientific community as three triumphs of 
modern physical science. Particles and fields obey quantum mechanical rules and 
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spacetime bends and warps according to Einstein’s classical description of gravity. 
The dynamical arena for particles, fields and spacetime is the universe and modern 
cosmology is born as the application of these physical laws to the universe itself. 

The path forward seemed clear but in the 1950s and 1960s physicists first began 
to appreciate that trying to quantize the classical gravitational field led to severe 
inconsistencies. This nonrenormalizability problem was further sharpened once the 
quantization of gauge (vector) fields was well understood in early 1970s. 

At about the same time, string theory — a quantum mechanical theory of one- 
dimensional objects — was found to contain a graviton (a massless spin-2 particle). 
The promise of a self-consistent theory of quantum gravity has attracted huge 
attention since the 1980s. Tremendous progress has been made in the past 30 years 
in understanding many aspects of string theory; yet today we are no closer to 
knowing which of many theoretical possibilities might describe our universe and we 
still lack experimental evidence that the theory describes nature. 

By now, string theory is such a large topic with so many research directions that 
we have to choose a specific area to discuss. The centenary of general relativity 
prompts us to reflect on two very amazing manifestations of general relativity: 
namely black holes and cosmology. Given the tremendous progress in cosmology 
in the past decades, both in observation and in theory, we shall focus here on the 
intersection of string theory and cosmology. 

Our fundamental understanding of low energy physics is grounded in quantum 
field theory and gravity but if we hope to work at the highest energies then we 
cannot escape the need for an ultraviolet completion to these theories. String the- 
ory provides a suitable framework. The string scale, Ms = 1/ Va! where a’ is the 
Regge slope, is expected to be below the reduced Planck scale Mp; = 1/V8aG ~ 
2.4 x 1018 GeV (where G is the Newton’s constant) but not too far from the grand 
unified scale Maur ~ 101° GeV (where the proton mass is about 1 GeV, in units 
where c = h = 1). Particle physics properties at scales much lower than Ms are hard 
to calculate within our present understanding of string theory. High energy exper- 
iments can reach only multi-TeV scales, orders of magnitude below the expected 
Msg. The difficulty of finding a common arena to compare theory and observation 
is hardly a new dilemma: although standard quantum mechanics works well at and 
above eV energy scales, it is woefully inadequate to describe properties of proteins 
which typically involve milli-eV scales and lower. Where should we look? 

One place to look for stringy effects is in the early universe. If the energy scale of 
the early universe reaches Mg, an abundance of physical effects will provide a more 
direct path to test our ideas than currently feasible in high energy experiments. 
In fact, attempts to interpret observations will tell us whether our present under- 
standing of physics and the universe is advanced enough to permit us to ask and 
answer sensible questions about these remote physical regimes. We shall focus on 
the inflationary universe scenario,’ ° which reaches energy scales sufficiently high 
to create ample remnants. The hot big bang’s nucleosynthetic epoch has direct 
observational support at and below MeV scales whereas the energies of interest to 
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us are ~ Mg. We are most fortunate that precision cosmology has begun to let 
us address questions relevant to these extreme conditions. Even in this rather spe- 
cialized topic of inflationary cosmology and string theory, a comprehensive review 
by Baumann and McAllister* appeared recently. So we shall take the opportunity 
here to present a brief introduction and share within this framework some of our 
own views on a particularly interesting topic, namely, cosmic strings. The subject 
of cosmic strings has been extensively studied too, so here we shall focus on low 
tension cosmic strings, which may appear naturally in string theory. 

Since current gravitational observations are insensitive to quantum effects, one 
may wonder why an ultraviolet completion of the gravity theory is needed if infla- 
tion, in fact, occurs several orders of magnitude below the Planck scale. To partly 
answer this, we like to recall that in 1920s, there was no obvious necessity for a 
quantum theory consistent with special relativity given that electrons in atoms and 
materials move at nonrelativistic speed only. However, the Dirac theory predicted 
anti-matter, the value of the electron’s gyromagnetic moment and a host of interest- 
ing properties (e.g. the Lamb shift) that are well within reach of the nonrelativistic 
study of electrons and atoms. The standard electroweak model, the ultraviolet com- 
pletion of the Fermi’s weak interaction model, is another example. Among other 
successes, the standard electroweak model predicted the presence of the neutral 
current as well as explains the 7° decay to two photons via anomaly and the basis 
of the quark—lepton family via the anomaly free constraints. All these phenomena 
are manifest at energy scales far below the electroweak scale. In the current con- 
text, we need to search for stringy effects that are unexpected in classical general 
relativity and might be observed in the near future at energy scales far below the 
Planck scale. 

There are many specific ways to realize inflation within string theory. These 
may be roughly grouped into models with small and large field ranges. They have 
rather different properties and predictions. Recall that 6 of the 9 spatial dimensions 
must be compactified into a Calabi-Yau like manifold with volume Vg, where 


2 
(FE) ~ M8Ve > 1. (1) 
During inflation the inflaton field ¢ moves from an initial to final position within 
the manifold, covering a path or field range Ad. Now, the distance scale inside 
the compactified manifold is bounded by Mp). The case Ad < Mp, is known as 
small field inflation. The opposite limit when the field range far exceeds the typical 
scale of the manifold, Ad >> Mp, is large field inflation. In the latter case the 
inflaton wanders inside the compactified manifold for a while during inflation. This 
scenario may occur if the inflaton is an axion-like field and executes a helical-like 
motion. Flux compactification of the six spatial dimensions in string theory typically 
introduces many axions and naturally sets the stage for this possibility. 

The recent B-mode polarization measurement in the cosmic microwave back- 
ground (CMB) radiation® by BICEP2 is most interesting. Primordial B-mode 


II-276 D. F. Chernoff and S.-H. H. Tye 


polarization is sourced by tensor perturbations during inflation and measurable 
primordial B-mode polarization implies that Ag >> Mp). This is a prototypical 
example of how it is now possible to probe high energies and early moments of the 
universe’s history by means of cosmological observations. However, since dust can 
contaminate today’s observed signal,” it remains to be seen whether the BICEP2 
observations definitively imply that large field inflation takes place. We believe this 
issue should be settled soon. In the meantime, we shall entertain both possibilities 
here. 

In both small and large field ranges macroscopic, one-dimensional objects, here- 
after cosmic strings, can appear rather naturally. In the former case, the cosmic 
strings can be the fundamental superstrings themselves, while in the latter case, 
they can be fundamental strings and/or vortices resulting from the Higgs like mech- 
anism. Cosmic strings with tension above the inflation scale will not be produced 
after inflation and we should expect only relatively low tension strings to be pro- 
duced. Because of the warped geometry and the presence of throats in the com- 
pactification in string theory, some cosmic strings can acquire very low tensions. 
In flux compactification such strings tend to have nontrivial tension spectrum (and 
maybe even with beads). 

If the cosmic string tension is close to the present observational bound (about 
Gu < 10~*), strings might contribute to the B-mode polarization at large @ mul- 
tipoles independent of whether or not BICEP2 has detected primordial B mode 
signals. The string contribution to the CMB power spectrum of temperature fluc- 
tuations is limited to be no more than ~ 10% and this constrains any string B-mode 
contribution. Because the primordial tensor perturbation from inflation decreases 
quickly for large multipoles one very informative possibility is that cosmic strings 
are detected at large ¢ and other astrophysical contributions are sub-dominant. 

On the other hand, if tension is low (Gu < 10~) then the cosmic string effect for 
all @ is negligible compared to known sources and foreground contributions. Other 
signatures must be sought. For small Gy string loops are long lived and the oldest 
(and smallest) tend to cluster in our galaxy, resulting in an enhancement of ~ 10° in 
the local string density. This enhancement opens up particularly promising avenues 
for detecting strings by microlensing of stars within the galaxy and by gravitational 
wave emission in the tension range 107!4 < Gu « 107". As a cosmic string passes 
in front of a star, its brightness typically doubles, as the two images cannot be 
resolved. As a string loop oscillates in front of a star, it generates a unique signature 
of repeated achromatic brightness doubling. Here, we briefly review the various 
observational bounds on cosmic strings and estimate the low tension cosmic string 
density within our galaxy as well as the likelihood of their detection in the upcoming 
observational searches. It is encouraging that searches of extrasolar planets and 
variables stars also offer a chance to detect the micro-lensing of stars by cosmic 
strings. Once a location is identified by such a detection, a search for gravitational 
wave signals should follow. 
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Detection of cosmic strings followed by the measurement of their possible dif- 
ferent tensions will go a long way in probing superstring theory. Although a single 
string tension can easily originate from standard field theory, a string tension spec- 
trum should be considered as a distinct signature of string theory. It is even possible 
that some cosmic strings will move in the compactified dimensions with warped 
geometry, which can show up observationally as strings with varying tension, both 
along their lengths as well as in time. In summary, cosmic strings probably offer 
the best chance of finding distinct observational support for the string theory. 


2. The Inflationary Universe 


So far, observational data agrees well with the simplest version of the slow-roll infla- 
tionary universe scenario, i.e. a single, almost homogeneous and isotropic, scalar 
inflaton field d = @(t) subject to potential V(d). For a Friedmann—Lemaitre— 
Robertson—Walker metric, general relativity yields schematic simple equations for 
the cosmic scale factor a(t) and for ¢, 


2 yD 2 
w= (2) =a [VO =| eit ets © 
$+3H¢ = - 5 = -V'(d), (3) 


where the Hubble parameter H = a/a (a(t) = da/dt) measures the rate of expansion 
of the universe, and p-(0), Pm(0) and p,(0) are the curvature, the nonrelativistic 
matter and the radiation densities at time t = 0.* In an expanding universe, a(t) 
grows and the curvature, the matter and the radiation terms diminish. The energy 
and pressure densities for the inflaton are 


pPo=—= +V(9), Pp==Z-VYO), (4) 


where a canonical kinetic term for ¢ is assumed. If V(@) is sufficiently flat and if 
é is initially small then Eq. (3) implies that ¢ moves slowly in the sense that its 
kinetic energy remains small compared to the potential energy, 62? < V(@). In the 
limit that V(@) is exactly constant and dominates in Eq. (2) then H is constant 
and a(t) « e”*. More precisely one may define the inflationary epoch as that period 


when the expansion of the universe is accelerating, i.e. d > 0, or, in a spatially flat 


“The energy density has been divided into pm and p;. The ultra-relativistic massive particles are 
lumped into p; and the marginally relativistic massive particles included in pm. This division is 
epoch-dependent and schematic. 
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universe, 


2MpH = —(o +P); 


<1. 
Slow-roll inflation means small ¢, H nearly constant and a(t) ~ a(0)e”*. 

In a typical slow-roll inflationary model inflation ends at t = teng when the 
inflaton encounters a steeper part of the potential and ¢ > 1. The number of e-folds 
of inflation is Ne ~ Htena, a key parameter. The energy released from the potential 
V(@) heats the universe at the end of inflation and starts the hot big bang. The 
period of slow-roll must last at least N. > 50 e-folds to explain three important 
observations about our universe that are otherwise unaccounted for in the normal 
big bang cosmology: flatness, lack of defects and homogeneity. For any reasonable 
initial curvature density p.(0), the final curvature density pc(tena) < pc(0)e °° 
will be totally negligible, thus yielding a flat universe. This is how inflation solves 
the flatness problem. Any defect density (probably included in p,,(0)) present in 
the universe before the inflationary epoch will also be inflated away, thus solving the 
so-called “monopole” or defect problem. Since the cosmic scale factor grows by a 
huge factor, the universe we inhabit today came from a tiny patch of the universe 
before inflation. Any original inhomogeneity will be inflated away. Inflation explains 
the high degree of homogeneity of the universe. In summary, if N. is sufficiently 
large then inflation accounts for the universe’s observed flatness, defect density and 
homogeneity. 

What is amazing is that inflation also automatically provides a mechanism to 
create primordial inhomogeneities that ultimately lead to structure formation in our 
universe. As the inflaton slowly rolls down the potential in the classical sense, quan- 
tum fluctuations yield slightly different ending times tenq and so slightly different 
densities in different regions. The scalar and the metric fluctuations may be treated 
perturbatively, and one obtains the dimensionless power spectra in terms of H, 


1 #? 
2 a 
ash) = 8n2 M2,|e|’ 
2 H? 
2 ee — pA2 
A2(k) = 2M rA2(k), (6) 
r= 16e, 


where AZ, and Aj, are the scalar and the tensor modes, respectively. A scalar scale- 
invariant power spectrum corresponds to constant A2, which occurs when a space 
expands in a nearly de-Sitter fashion for a finite length of time. The scalar mode is 
related to the temperature fluctuations first measured by COBE.? Measurements 
and modeling have been refined over the past two decades. Since @ rolls in the 
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nonflat potential in the inflationary scenario, Ae will have a slight k dependence. 
This is usually parametrized with respect to the pivot wave number k, in the form 


a ns—1+(dns/dink) In(k/kp)/2+--, 
) (7 


AB(k) = Abby) (F 

Pp 
k ne+(dnt/dlnk) In(k/kp)/2+--, 
i) 


A2(k) = A2 (kp) ( (8) 


where the PLANCK and WMAP best fit values for ACDM quoted in Ref. 10 for 
kp = 0.05 Mpc! are 


ns = 0.9603 + 0.0073, (9) 


ADRs) = 219 es x 10. (10) 


These do not change appreciably when a possible tensor component is included 
and PLANCK constraints on r are given at pivot 0.002Mpc7!. In the slow-roll 
approximation, where b is negligible in Eq. (3), the deviation from scale-invariance 
is quantified by the spectral tilt 


In A2 
ne—1= 5 7 a = —6¢ + 2n, 

: 7 (11) 

Ws N 24 2 

Tak 16en + 24e° + 2°, 
where the parameters that measure the deviation from flatness of the potential are 

M3, (V'\” 

= —_ 12 
c=F(S), (12) 
= Mav" 2 _ MgV'v" (13) 

4 V2 


where € and 7 are the known as the slow-roll parameters which take small values 
during the inflationary epoch. The tensor mode comes from the quantum fluctua- 
tion of the gravitational wave. Its corresponding tilt is 


Ne = —2e. (14) 


For a small field range Ad, the constraint on N, dictates a rather flat potential 
and small €, which in turn implies a very small r. This relation can be quantified 


by the Lyth bound," 
Ad r 
a SS =a 
Mp welt ne) 


If 60 > N. > 40, we find that typical values of r satisfies r < 0.005 for Ad < Mp). 
This is much smaller than r ~ 0.2 reported by BICEP2.® 
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While A? is a combination of H? and 1/e, Az, provides a direct measurement 
of H? and hence the magnitude of the inflaton potential during inflation: 


a [a 16 4 
Ve (=) (2.2 x 10!6 Gev)4. (16) 


The imprint of tensor fluctuations are present in the CMB but AZ, «< Az so 
it is not possible to measure A. directly from total temperature fluctuations at 
small ¢. Fortunately, the CMB is linearly polarized and can be separated into 
E-mode and B-mode polarizations. It happens that A}. contributes to both modes 
while A2 contributes only to the E-mode. So a measurement of the primordial 
B-mode polarized CMB radiation is a direct measurement of Az. 

Searching for the B-mode CMB is very important. Besides the intrinsic smallness 
of the B-mode signal, which makes detection a major challenge, the primordial 
fluctuation may be masked by the interstellar dust. It is likely that the uncertainty 
from dust would have been resolved by the time this paper appears. So we may 
consider this as a snapshot after the announcement of the BICEP2 data and before 
a full understanding of the impact of dust on the reported detection. Due to this 
uncertainty, we shall consider both a negligibly small r (say r < 0.002) and an 
observable primordial r < 0.2. 


3. String Theory and Inflation 


By now, string theory is a huge research subject. So far, we have not yet identified 
the corner where a specific string theory solution fits what happens in nature. It is 
controversial to state whether we are close to finding that solution or we are still 
way off. By emphasizing the cosmological epoch, we hope to avoid the details but 
try to find generic stringy features that may show up in cosmological observations. 
Here we give a lightning pictorial summary of some of the key features of Type 
IIB superstring theory so readers can have at least a sketchy picture of how string 
theory may be tested. 

Because of compactification, we expect modes such as Kaluza—Klein modes to 
appear in the effective four-dimensional theory. Their presence will alter the relation 
between the inflaton potential scale and r (16) to: 


kg oe r 16 4 
a5 (S) (2.2 x 10'6 Gev)s, (17) 


bSince tensor and scalar perturbations have different £ dependencies experimentalists deduce the 
scalar spectrum from high £ measurements where scalar power will dominate any tensor contri- 
bution. Then they observe the total temperature fluctations at low @ where tensors should make 
their largest impact. If the theoretical relation between low and high @ fluctuations is known 
(“no running” being the simplest possibility) then the total observed low ¢ power limits whatever 
extra contribution might arise from the tensor modes. The WMAP!” and Planck! collaborations 
(incorporating data from the South Pole Telescope (SPT) and Atacama Cosmology Telescope 
(ACT) microwave background experiments and baryon acoustic oscillation observations) placed 
limits on r, r < 0.13 and r < 0.11, respectively. These upper limits on r are somewhat less than 
the r values reported by BICEP2 and under investigation. !4 
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where N effectively counts the number of universally coupled (at one-loop) degrees 
of freedom below the energy scale of interest here.!° Present experimental bound 
925 


on N is very loose. Surprisingly, a value as big as N ~ 102° is not ruled out. 


3.1. String theory and flux compactification 


Here is a brief description on how all moduli are dynamically stabilized in flux 
compactification. Recall that 2-form electric and magnetic field strengths follow 
from the 1-form field A, in the electromagnetic theory, under which point-like 
particles (such as electrons) are charged. In analogy to this, fundamental strings 
in string theory are charged under a 2-form field B,, which yields 3-form field 
strengths. In general, other dimensional objects are also present in string theory, 
which are known as branes. A p-brane spans p spatial dimensions, so a membrane 
is a 2-brane while a 1-brane is string like; that is a 1-brane is really a string. In the 
Type IIB superstring theory, there are a special type of Dp-branes where p is an odd 
integer. Among other properties of Dp-branes, we like to mention two particularly 
relevant ones here : (1) each end of an open string must end on a Dp-brane, and 
(2) with the presence of D1-string (or -brane), there exists another 2-form field C,,, 
under which a D1-string is charged. 

Self-consistency of Type IIB superstring theory requires it to have nine spatial 
dimensions. Since only three of them describe our observable world, the other six 
must be compactified. Consider a stack of D3-branes of cosmological size. Such a 
stack appear as a point in the six compactified dimensions. In the brane world 
scenario, all standard model particles (electrons, quarks, gluons, photons, etc.) 
are light open string modes whose ends can move freely inside the D3-branes, 
but cannot move outside the branes, while the graviton, being a closed string 
mode, can move freely in the branes as well as outside, i.e. the bulk region of 
the six compactified dimensions. In this sense, the D3-branes span our observ- 
able universe. We can easily replace the D3-branes by D7-branes, with 4 of their 
dimensions wrapping a 4-cycle inside the compactified six-dimensional space. Dark 
matter may come from unknown particles inside the same stack, or from open 
string modes sitting in another stack sitting somewhere else in the compactified 
space. 

As a self-consistent theory, the six extra dimensions must be dynamically com- 
pactified (and stabilized). This is a highly nontrivial problem. Fortunately, the 
3-form field strengths of both the NS—NS field B,,, and the R-R field Ci, are 
quantized in string theory. Wrapping 3-cycles in the compactified dimensions, these 
fluxes contribute to the effective potential V in the low energy approximation. Their 
presence can render the shape and the size of the compactified six-dimensional 
space to be dynamically stabilized. This is known as flux compactification. Here 
the matter content and the forces of nature are dictated by the specific flux com- 
pactification.!®!7 Warped internal space appears naturally. This warped geometry 
will come to play an important role in cosmology. 
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To describe nature, a Calabi-Yau like manifold is expected, with branes and 
orientifold planes. At low energy, a particular manifold may be described by a 
set of dynamically stabilized scalar fields, or moduli. One or more Kahler moduli 
parameterize the volume while the shape is described by the complex structure 
moduli, whose number may reach hundreds. A typical flux compactification involves 
many moduli and three-form field strengths with quantized fluxes (see Ref. 18). 
With such a large set of dynamical ingredients, we expect many possible vacuum 
solutions; collectively, this is the string theory landscape, or the so-called cosmic 
landscape. Here, a modulus is a complex scalar field. Written in polar coordinate, 
we shall refer to the phase degree of freedom as an axion. 

For a given Calabi-Yau like manifold, we can, at least in principle, determine 
the four-dimensional low energy supergravity effective potential V for the vacua. 
To be specific, let us consider only 3-form field strengths F{ wrapping the three- 
cycles inside the manifold. (Note that these are dual to the four-form field strengths 
in four-dimensional spacetime.) We have V(Fi,¢;) > V(ni,¢;), (¢ = 1,2,...,N, 
j =1,2,...,K) where the flux quantization property of the 3-form field strengths 
F§ allow us to rewrite V as a function of the quantized values n; of the fluxes 
present and ¢; are the complex moduli describing the size and shape of the com- 
pactified manifold as well as the couplings. There are barriers between different sets 
of flux values. For example, there is a (finite height) barrier between n1 and n; —1, 
where tunneling between V(n1,n2,...,2n,¢;) and V(n; — 1,n2,...,nN,@;) may 
be achieved by brane-flux annihilation.!® For a given set of n;, we can locate the 
meta-stable (classically stable) vacuum solutions V(n;,¢;) by varying @;. We sift 
through these local minima which satisfy the following criteria: they have vanish- 
ingly small vacuum energies because that is what is observed in today’s universe, 
and long decay lifetimes to lower energy states because our universe is long-lived. 
These criteria restrict the manifolds, flux values and minima of interest; nonethe- 
less, within the rather crude approximation we are studying, there still remain 
many solutions; and statistically, it seems that a very small cosmological constant 
is preferred.?° 


3.2. Inflation in string theory 


When the universe was first created (say, a bubble created via tunneling from noth- 
ing), ¢; are typically not sitting at ¢;, min, So they tend to roll toward their stabilized 
values. Heavier moduli with steeper gradients probably reached their respective 
minima relatively quickly. The ones with less steep directions took longer. The 
last ones to reach their stabilized values typically would move along relatively flat 
directions, and they can play the role of inflatons. So the vacuum energy that 
drives inflation is roughly given by the potential when all moduli except the infla- 
ton have already reached bottom. Since the flux compactification in string theory 
introduces dozens or hundreds of moduli (each is a complex scalar field in the 
low energy effective theory approximation), one anticipates that there are many 
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candidates for inflation. Even if we assume that just one field is responsible for 
the inflationary epoch in our own universe it seems likely that string theory can 
choose among many possibilities to realize single field inflation in many different 
ways. 

In fact, the picture from string theory may offer additional interesting possibili- 
ties beyond field theory. For example, suppose there was a pair of brane—anti-brane 
present in the early universe whose tensions drive the inflation. As we shall see, 
the inflaton field happens to be the distance between them. After their annihilation 
that ends inflation, the inflaton field no longer exists as a degree of freedom in the 
low energy effective field theory. 

Some inflationary scenarios in string theory generate unobservably small pri- 
mordial r while others give observably large primordial r. Small field range implies 
small r, but large field range does not necessary imply large r, which happens when 
the Lyth bound (15) is not saturated. To obtain enough e-folds, we need a flat 
enough potential, so it is natural to consider an axion as the inflaton, as first pro- 
posed in natural inflation.?)?? The axion has a natural shift symmetry ¢ > ¢+ 
any constant. This continuous symmetry can easily be broken, via nonperturba- 
tive effects, to a discrete symmetry, resulting in a periodic potential that can drive 
inflation. The typical field range is sub-Planckian implying a small r. We shall see 
shortly that there are many string theory models built out of the axion that permit 
large field ranges and some of these can yield large r. 

Here, we shall describe the various string theory realizations of inflation 
within the framework of Type IIB string theory. The pictorially simplest mod- 
els occur in the brane world scenario. We shall present a few sample models to give 
a picture of the type of models that have been put forward. Readers can find a 
more complete list in Ref. 4. If r turns out to be large, it will be interesting to see 
whether some of the small r models can be adapted or modified to have a large 
r. For example, one can try to modify a brane inflationary scenario into a warm 
inflationary scenario with a relatively large r. 


4. Small r Scenarios 


Besides fundamental superstrings, Type IIB string theory has Dp-branes where 
the number of spatial dimensions p is odd. Furthermore, supersymmetry can be 
maintained if there are only D3- and D7-branes; so, unless specified otherwise, we 
shall restrict ourselves to this case. String theory has nine spatial dimensions. Since 
our observable universe has only three spatial dimensions, the other six spatial 
dimensions have to be compacted in a manifold with volume Vg. The resulting (i.e. 
dynamically derived) Mp; is related to the string scale Mg via Eq. (1) and the 
typical field range A¢@ is likely to be limited by Mp; > Mg. For such a small field 
range, the potential has to be flat enough to allow 50 or more e-folds and the Lyth 
bound (15) for 60 > N. > 40 implies r < 0.005. We shall refer to these models as 
small r scenarios. 
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4.1. Brane inflation 


The discovery of branes in string theory demonstrated that the theory encompasses 
a multiplicity of higher-dimensional, extended objects and not just strings. In the 
brane world scenario, our visible universe lies inside a stack of D3-branes, or a 
stack of D7-branes. Here, 6 of the 9 spatial dimensions are dynamically compact- 
ified while the three spatial dimensions of the D3-branes (or 3 of the D7-branes) 
are cosmologically large. The six small dimensions are stabilized via flux compact- 
ification!®1"; the region outside the branes is referred to as the bulk region. The 
presence of RR and NS-NS fluxes introduces intrinsic torsion and warped geometry, 
so there are regions in the bulk with warped throats (Fig. 1). Since each end of an 
open string must end on a brane, only closed strings are present in the bulk away 
from branes. 

There are numerous such solutions in string theory, some with a small positive 
vacuum energy (cosmological constant). Presumably the standard model particles 
are open string modes; they can live either on D7-branes wrapping a 4-cycle in the 
bulk or (anti-)D3-branes at the bottom of a warped throat (Fig. 1). The (relative) 
position of a brane is an open string mode. In brane inflation,?° one of these modes 
is identified as the inflaton ¢, while the inflaton potential is generated by the clas- 
sical exchange of closed string modes including the graviton. From the open string 
perspective, this exchange of a closed string mode can be viewed as a quantum loop 
effect of the open string modes. 


Na 


B A 


Fig. 1. A pictorial sketch of a generic flux compactified six-dimensional bulk, with a number of 
warped throats (4 of which are shown here). Besides the warped throats, there are D7-branes 
wrapping 4-cycles. The blue dots stand for mobile D3-branes while the red dots are D3-branes 
sitting at the bottoms of throats. In the D3-D3-brane inflationary scenario, the tension of the 
brane pairs provide the vacuum energy that drives inflation as the D3-brane moves down A- 
throat. Inflation ends as the D3-brane annihilates with the D3-brane in A-throat. The standard 
model branes may live in A-throat or S-throat. As an alternative, inflation may take place while 
branes are moving out of B-throat. (For color version, see page II-CP14.) 
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4.1.1. D3-D3-brane inflation 


In the early universe, besides all the branes that are present today, there is an extra 
pair of D3-D3-branes.2*?° Due to the attractive forces present, the D3-brane is 
expected to sit at the bottom of a throat. Here again, inflation takes place as the D3- 
brane moves down the throat toward the D3-brane, with their separation distance 
as the inflaton, and inflation ends when they collide and annihilate each other, 
allowing the universe to settle down to the string vacuum state that describes our 
universe today. Although the original version encounters some fine-tuning problems, 
the scenario becomes substantially better as we make it more realistic with the 
introduction of warped geometry.° 

Because of the warped geometry, a consequence of flux compactification, a mass 
M in the bulk becomes haM at the bottom of a warped throat, where ha < 1 
is the warped factor (Fig. 1). This warped geometry tends to flatten, by orders of 
magnitude, the inflaton potential V(¢), so the attractive D3-D3-brane potential 
is rendered exponentially weak in the warped throat. The attractive gravitational 
(plus RR) potential together with the brane tensions takes the form 


4 
V(¢) = 2Tshig (1 = ma): (18) 
where T3 is the D3-brane tension and the effective tension is warped to a very small 
value T3h4,. The warp factor h depends on the details of the throat. Crudely, 
h(¢) ~ 6/@edge, where ¢ = dedge When the D3-brane is at the edge of the throat, so 
h(dedge) X 1. At the bottom of the throat, where ¢ = 4 X edge, ha = h(a) = 
A/ edge. The potential is further warped because N4 > 1 is the D3 charge of the 
throat. The D3-D3-brane pair annihilates at ¢ = ¢4. In terms of the potential (18), 
a tachyon appears as ¢ — ¢y so inflation ends as in hybrid inflation. The energy 
released by the brane pair annihilation heats up the universe to start the hot big 
bang. To fit data, h4 ~ 10~?. If the last 60 e-folds of inflation takes place inside 
the throat, then @cage > @ = ba during this period of inflation. This simple model 
yields n, ~ 0.97, r < 10~° and vanishing non-Gaussianity. 

The above model is very simple and well motivated. There are a number of inter- 
esting variations that one may consider. Within the inflaton potential, we can add 
new features to it. In general, we may expect an additional term in V(¢) of the form 

BH? ¢? 
a? 
where H is the Hubble parameter so this interaction term behaves like a conformal 


coupling. Such a term can emerge in a number of different ways: 


e Contributions from the Kahler potential and various interactions in the superpo- 
tential?® as well as possible D-terms,?” so G may probe the structure of the flux 
compactification.2%:79 

e It can come from the finite temperature effect. Recall that finite temperature T 


induces a term of the form T?¢? in finite temperature field theory. In a de-Sitter 


II-286 D. F. Chernoff and S.-H. H. Tye 


universe, there is a Hawking—Gibbons temperature of order H thus inducing a 
term of the form H?¢?. 

e The D3-brane is attracted to the D3-brane because of the RR charge and the 
gravitational force. Since the compactified manifold has no boundary, the total 
RR charge inside must be zero, and so is the gravitational “charge.” A term of 
the above form appears if we introduce a smooth background “charge” to cancel 
the gravitational “charges” of the branes.°° On the other hand, negative tension 
is introduced via orientifold planes in the brane world scenario. If the throat is 
far enough away from the orientifold planes, it is reasonable to ignore this effect. 


Overall, 3 is a free parameter and so is naively expected to be of order unity, 
GB ~ 1. However, the above potential yields enough inflation only if @ is small 
enough, 3 < 1/5.3! The present PLANCK data implies that ( is essentially zero or 
even slightly negative. 

Another variation is to notice that the six-dimensional throat can be quite non- 
trivial. In particular, if one treats the throat geometry as a Klebanov—Strassler 
deformed conifold, gauge-gravity duality leads to the expectation that ¢ will 
encounter steps as the D3-brane moves down the throat.?? Such steps, though 
small, may be observed in the CMB power spectrum. 


4.1.2. Inflection point inflation 


Since the six-dimensional throat has one radial and five angular modes, V(¢) gets 
corrections that can have angular dependencies. As a result, its motion toward 
the bottom of the throat may follow a nontrivial path. One can easily imagine a 
situation where it will pass over an inflection point. Around the inflection point, 
V(@) may take a generic simple form 


BY C¢° 

ae 
where 7 = 0 at the inflection point B + 2C¢ = 0. As shown in Refs. 29 and 33, 
given that V(@) is flat only around the inflection point, Ne ~ 1/\/e, so € must 
be very small, resulting in a very small r. Here n, — 1 = 27 so we expect ns to 
be very close to unity, with a slight red or even blue tilt. Although motivated in 
brane inflation, an inflection point may be encountered in other scenarios of the 
inflationary universe, so it should not be considered as a stringy feature. 


V(¢) ~Yt+ Aes 


4.1.3. DBI model 


Instead of modifying the potential, string theory suggests that the kinetic term for 
a D-brane should take the Dirac-Born—Infeld form?*;3° 


1 1 1 
50 Pon > —-2- 4/1— f(¢)0"b0,¢ + Fe’ 
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where f(¢) ~ h(¢)~* ~ A/¢* is the warp factor of a throat. This DBI property is 
intrinsically a stringy feature. Here, the warp factor plays the role of a brake that 
slows down the motion of the D-brane as it moves down the throat, 


f(d)(019)* <1. 


This braking mechanism is insensitive to the form of V(¢), so many e-folds are 
assured as ¢ — 0. It produces a negligibly small r but a large non-Gaussianity in 
the equilateral bi-spectrum as the sound speed becomes very small. The CMB data 
has an upper bound on the non-Gaussianity that clearly rules out the dominance 
of this stringy DBI effect. Nevertheless, it is a clear example that stringy features 
of inflationary scenarios may be tested directly by cosmological observation. 

Instead of moving down a throat, one may also consider inflation while a brane 
is moving out of a throat.?637 In this case, the predictions are not too different 
from that of the inflection point case. 


4.1.4. D3-D7-brane inflation 


Here, the inflaton is the position of a D3-brane moving relative to the position of 
a higher-dimensional D7-brane.** Since the presence of both D3 and D7-branes 
preserves supersymmetry (as opposed to the presence of D5-branes), inflation can 
be driven by a D-term and a potential of the form like 


V(b) =Vo talnd — bd? + cd* +--- 


may be generated. Such a potential yields n, ~ 0.98, which may be a bit too big. 
One may lower the value a little by considering variations of the scenario. In any 
case, one ends up with a very small r. 


4.2. Kahler moduli inflation 


In flux compactification, the Kahler moduli are typically lighter than the complex 
structure moduli. Intuitively, this implies their effective potentials are flatter than 
those for the complex structure moduli. For a Swiss-cheese like compactification, 
besides the modulus for the overall volume, we can also have moduli describing the 
sizes of the holes inside the manifold. So it is reasonable to find situations where a 
Kahler modulus plays the role of the inflaton. A potential is typically generated by 
nonperturbative effects, so examples may take the form*® 


V ~VW(1— Ae **), (20) 


where both A and k are positive and of order unity or bigger. Here, 7 ~ —Ak?e7*? < 
0 and € ~ 7?/2k?, so the potential is very flat for large enough ¢. If the inflaton 
measures the volume of a blow-up mode corresponding to the size of a 4-cycle in a 
Swiss-Cheese compactification in the large volume scenario, we have k ~ V/V nV, 
where the compactification volume is of order VY ~ 10° in string units, so k is huge 
and r © 2(n, — 1)?/k? ~ 1071. Models of this type are not close to saturating 
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the Lyth bound (15). Other scenarios*® of this type again have small r. It will be 
interesting whether this type of models can be modified to have a larger r. 


5. Large r Scenarios 


The Lyth bound (15) implies that large r requires large inflaton field range Ad >> 
Mp. A phenomenological model with relatively flat potential and large field range 
is not too hard to write down, e.g. chaotic inflation. However, the range of a typical 
modulus in string theory is limited by the size of compactification, A@ < Mp). Even 
if we could extend the field range (e.g. by considering an irregular shaped manifold), 
the corrections to a generic potential may grow large as @ explores a large range. 
Essentially, we lose control of the approximate description of the potential. Axions 
allow one to maintain control of the approximation used while exploring large 
ranges. Here we shall briefly review two ideas, namely the Kim—Nilles—Peloso Mech- 
anism*! and the axion monodromy.****% In both cases, an axion moves in a helical- 
like path. The flatness of the potential and the large field range appear naturally. 


5.1. The Kim—Nilles—Peloso mechanism 


Let us start with a simple model and then build up to a model that is relatively 
satisfactory. 


5.1.1. Natural inflation 


It was noticed long ago that axion fields may be ideal inflaton candidates because 
an axion field ¢, a pseudo-scalar mode, has a shift symmetry, 6 — 6+ const. This 
symmetry is broken to a discrete symmetry by some nonperturbative effect so that 


a periodic potential is generated,?!:?? 
o) A m? 
veo) =a cos ($) ope prea ¢’, (21) 


where we have set the minima of the potential at ¢ = 0 to zero vacuum energy. 
With suitable choice of A and f the resulting axion potential can be relatively 
small and flat, an important property for inflation. As f becomes large, this model 
approaches the (quadratic form) chaotic inflation,*4 which is well studied. To have 
enough e-folds, we may need a large field range, say, Ad > 14 Mp, which is possible 
here only if the decay constant f > Ad >> Mp). This requires a certain degree of 
fine tuning since a typical f is expected to satisfy f < Mp) (see, for example, Fig. 2 
in Ref. 45). To fit the scalar mode perturbations of COBE m ~ 7 x 10~° Mp). The 
range of predictions*® in the r versus ns plot is shown in Fig. 2. 


5.1.2. N-flation 


There are ways to get around this to generate enough e-folds. One example is to 
extend the model to include N different axions (similar to the idea of using many 
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Fig. 2. The tensor-to-scalar ratio r versus the primordial tilt ns plot for pivot scale 0.002 Mpc™!. 


Pairs of solid and dashed semi-circular lines encompass 68% and 95% confidence limits for r and ns 
as given in Ref. 10. The regions are based on PLANCK data combined with: WMAP [large scale 
polarization] (gray), WMAP plus BAO [baryon acoustic oscillations in galaxy surveys] (blue) 
and WMAP plus higher € CMB data [ACT and SPT] (red) as described in detail in Ref. 10. 
All remaining lines, points and arrows describe theoretical models. Convex potentials lie above 
the straight, solid black line; concave potentials below it. Power law inflation generates results 
along the straight, dashed line. String-related models include the short yellow segment for a linear 
inflaton potential V ~ ¢, the short red segment for V ~ ¢2/3 and the short black segment (chaotic 
inflation) for V ~ ¢?. Each small (large) dot stands for 50 (60) e-folds. Two purple lines moving 
to the left show the full range of predictions for natural inflation. All small field models and 
some large field ones suppress r (these are not plotted). The black arrow is the vertical shift A, 
illustrating how the results for cosine potential differ from those of the ¢? potential. (For color 
version, see page II-CP14.) 


scalar fields*®-*”) each with a term of the form in Eq. (21), with a different decay 
constant f; < Mp). This N-flation model*® is a string theory inspired scenario since 
flux compactification results in the presence of many axions. In the approximation 
that the axions are independent of each other (that is, their couplings with each 
other are negligible), the analysis is quite straightforward and interesting.*? The 
predictions are similar to that of a large field model, that is, 0.93 < n, < 0.95 
but r < 107%. In general, the axions do couple to each other and the situation can 
be quite complicated. For a particularly simple case, inspired by string theory and 
supergravity, a statistical analysis has been carried out and clear predictions can 
be made. For a large number of axions, one has 


V(¢i) = Vo + So ai cos (4) + S85 cos (# a 4). (22) 
; fi . i& 
The statistical distribution of results yields small values for r quite similar to that 
of the N-flation model. 
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5.1.3. Helical inflation 


Let us consider a particularly interesting case of a 2-axion model of the form*! 


vin nrtetaftnen(@)+afi-oa(-2)]} es 


where we take f{ < fi < Mp). 
Note that the second term in V (23), namely V2, vanishes at 


Ot og os 
fi fe 

This is the minimum of V2 and the bottom of the trough of the potential V. For 
large enough A, inflaton will follow the trough as it rolls along its path. Now alte Fi 
measures the number of cycles that @2 can travel for 0 < ¢; < f,. Although ¢2 has a 
shift symmetry $2 — ¢2+27f2, the path does not return to the same configuration 
after ¢2 has traveled for one period because ¢, has also moved. Thus, instead of a 
shift symmetry, the system has a helical symmetry. Moving along the path of the 
trough (24), we see that the second term in the potential (23) vanishes and so the 
potential V reduces to that with only the first term and the range of the inflaton 
field @2 can easily be super-Planckian. 

To properly normalize the fields, one defines two normalized orthogonal 
directions, 


_ Sidi + fade y — 2e1= fide 
the inflaton will roll along the X-direction while Y-direction is a heavy mode which 
can be integrated out. Note that the system is insensitive to the magnitude of A as 
long as it is greater than O(1) such that Y-direction is heavy enough. For instance, 
the slow roll parameters € and 7 are not affected which means that the observables 
ns and r are insensitive to A. Since the X path is already at the minimum of the 
second term, at Y = 0, the effective potential along X is determined by Vj, 


X cos 6 1VyxX? 
V(X) = vo {1 cos ( — )} a 7 
fi 


So this helical model is reduced to the single cosine model (21) where 


= (4) f+ a~ oe 
can be bigger than Mp, even if all the individual f; < Mp). 

So it is not difficult to come up with stringy models that can fit the existing 
data. In view of the large B-mode reported by BICEP2,® this model was revisited by 
a number of groups.°° °” In fact, one can consider a more general form for Vi(¢1),°° 
which can lead to a large field range model with somewhat different predictions. 


(25) 


cos” 0, (26) 


cos @ = (27) 
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Since the shift symmetry of an axion is typically broken down to some discrete 
symmetry, the above cosine model is quite natural. The generalization to more than 
two axions is straightforward, and one can pile additional helical motions on top 
of this one, increasing the value of the effective decay constant f by additional big 
factors. All models in this class reduce to a model with a single cosine potential, 
which in turn resembles the quadratic version of chaotic inflation. This limiting 
behavior is quite natural in a large class of axionic models in the supergravity 
framework.°? Because of the periodic nature of an axionic potential, this cosine 
model can have a smaller value of r than chaotic inflation.44 The deviations from 
chaotic inflation occur in a well-defined fashion. Let 


— [2 
A=16A=r+4(n,—1) = ar 


where ¢? chaotic inflation has A = 0 while the cosine model has A < 0. All other 
quantities such as runnings of spectral indices have very simple dependencies on 
A. As shown in Table 1, this deviation is quite distinctive of the periodic nature of 
the inflaton potential. For n, = 0.96, r = 0.16 in the ¢? model, while r can be as 
small as r = 0.04 (or A= —0.12) in the cosine model. As data improves, a negative 
value of A can provide a distinctive signature for a periodic axionic potential for 
inflation. 


(28) 


5.2. Axion monodromy 


The above idea of extending the effective axion range can be carried out with 
a single axion, where the axion itself executes a helical motion. This is the axion 


Table 1. Comparison between ¢? chaotic inflation and the cosine model for the various 
physically measurable quantities®! where A (28) measures the difference. Here Mp = 1. 


V(¢) 3m? ¢? Vo[1 — cos(#)] 

e= a(V/V)? ir rid 
n=V"/V gr jer +24 
eavivny? 0 1A 

ea yiayoys 0 wprAt+rh? 

Ns — 1 = 2n —6e —4r —zrt+4A 

nt = —2e —3r -—gr 

ite = —16en + 24e? + 2€? wr pr —rA 

ak = —den + Be? at at =3r4 
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monodromy model.4?:43 In this scenario, inflation can persist through many periods 
around the configuration space, thus generating an effectively large field range with 
an observable r. 

Recall that a gauge field, or one-form field (i.e. with one spacetime index), is 
sourced by charged point-like fields, while a 2-form (anti-symmetric tensor) field 
is sourced by strings. So we see that there will be at least one 2-form field in 
string theory. Consider a 5-brane that fills our four-dimensional spacetime and 
wraps a 2-cycle inside the compactified manifold. The axion is the integral of a 2- 
form field over the 2-cycle. Integrating over this 2-cycle, the six-dimensional brane 
action produces a potential for the axion field in the resulting four-dimensional 
effective theory. Here, the presence of the brane breaks the axion shift symmetry 
and generates a monodromy for the axion. For NS5-branes, a typical form of the 
axion potential (coming from the DBI action) is 


V(¢) = AV/82 + #2. (29) 


For small parameter b, V(¢) ~ Ad. The prediction of such a linear potential is shown 
in Fig. 2. One can consider D5-branes instead. D7-branes wrapping 4-cycles in the 
compactified manifold is another possibility. For large values of ¢, a good axion 
monodromy model requires that there is no uncontrollable higher-order stringy or 
quantum corrections that would spoil the above interesting properties. ©* Variants 
of this picture may allow a more general form of the potential, say V(¢) ~ ¢? where 
p can take values such as p = 2/3, 4/3, 2,3, thus generating r ~ 0.04, 0.09, 0.13 and 
0.2, respectively. The p = 2 case predicts a n, value closest to r ~ 0.16 reported by 
BICEP2 after accounting for dust. The observational bounds are currently under 
scrutiny. Readers are referred to Ref. 4 for more details. 


5.3. Discussions 


If BICEP 2’s detection of r is confirmed, it does not necessarily invalidate completely 
all the small r models discussed above. It may be possible to modify some of them 
to generate a large enough r. As an example, if one is willing to embed the D3-D3- 
brane inflation into a warm inflationary model, a r ~ 0.2 may be viable.®* It will 
be interesting to re-examine all the small r string theory models and see whether 
and how any of them may be modified to produce a large r. 

As string theory has numerous solutions, it is not surprising that there are 
multiple ways to realize the inflationary universe scenario. With cosmological data 
available today, theoretical predictions and contact with observations are so far 
quite limited. As a consequence, it is rather difficult to distinguish many string 
theory inspired predictions from those coming from ordinary field theory (or even 
supergravity models). There are exceptions, as pointed out earlier. For example, the 
DBI inflation prediction of an equilateral bi-spectrum in the non-Gaussianity, or 
the determined spacing of steps in the power spectrum itself, may be considered to 
be distinct enough that if either one is observed, some of us may be convinced that 
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it is a smoking gun of string theory. Although searches for these phenomena should 
and would continue, so far, we have not been lucky enough to see any hint of them. 

Here we like to emphasize that there is another plausible signature to search 
for. Since all fundamental objects are made of superstrings (we include D1-strings 
here), and the universe is reheated to produce a hot big bang after inflation, it is 
likely that, besides strings in their lowest modes which appear as ordinary particles, 
some relatively long strings will also be produced, either via the Kibble mechanism 
or some other mechanism. They will appear as cosmic strings. 

When there are many axions, or axion-like fields, with a variety of plausible 
potentials, the possibilities may be quite numerous and so predictions may be 
somewhat imprecise. For any axion or would-be-Goldstone bosons, with a con- 
tinuous U(1) symmetry, we expect a string-like defect which can end up as cosmic 
strings if generated in early universe. In field theory, a vortex simply follows from 
the Higgs mechanism where the axion a appears as the phase of a complex scalar 
field, ® = pe**/f. In string theory, we note that such an axion a is dual to a 2-form 
tensor field Cy, i.e. Ma = €#”*"A,Cy,,. As pointed out earlier, such a 2-form field 
is sourced by a string. So the presence of axions would easily lead to cosmic strings 
(i.e. vortices, fundamental strings and D1-strings) and these may provide signatures 
of string theory scenarios for the inflationary universe. This is especially relevant 
if cosmic strings come in a variety of types with different tensions and maybe even 
with junctions. 

Following Fig. 1, we see that besides the axions responsible for inflation, there 
may be other axions. Some of them may have mass scales warped to very small 
values. If a potential of the form (21) is generated, a closed string loop becomes 
the boundary of a domain wall, or membrane. It will be interesting to study the 
effect of the membrane on the evolution of a cosmic string loop. The tension of the 
membrane and the axion mass are of order 


on VAf, m= 


It is interesting to entertain the possibility that this axion can contribute substan- 
tially to the dark matter of the universe. If so, its contribution to the energy density 
is roughly given by A while its mass is estimated to be m ~ 10~?? eV.™ This yields 
Aw 10-8Ms,, fe VA/m ~ 10-!°Mp) and o © 10-8 M3, ~ 10-14 GeV3. For 
such a small membrane tension, the evolution of the corresponding cosmic string is 
probably not much changed. 


6. Relics: Low Tension Cosmic Strings 


Witten’s® original consideration of macroscopic cosmic strings was highly influen- 
tial. He argued that the fundamental strings in heterotic string theory had tensions 
too large to be consistent with the isotropy of the COBE observations. Had they 
been produced they would be inflated away. It is not even clear how inflation might 
be realized within the heterotic string theory. 
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However, with the discovery of D-branes,®° the introduction of warped geome- 
tries®’ and the development of specific, string-based inflationary scenarios, the pic- 
ture has changed substantially. Open fundamental strings must end on branes; so 
both open and closed strings are present in the brane world. Closed string loops 
inside a brane will break up into pieces of open strings, so only vortices (which may 
be only metastable) can survive inside branes. Here, D1-strings (i.e. D1-branes) 
may be treated as vortices inside branes and survive long enough to be cosmologi- 
cally interesting.©° 

The warped geometry will gravitationally redshift the string tensions to low 
values and, consequently, strings can be produced after inflation. A string with 
tension T in the bulk will be warped to p = h?T with h being the warp factor, 
which can be very small when the string is sitting at the bottom of a throat. (It 
is h = ha in Eq. (18) in throat A.) That is important because relics produced 
during inflation are rapidly diluted by expansion. Only those generated after (or 
very near the end of) inflation are potentially found within the visible universe. 
Since the Type IIB model has neither D0O-branes nor D2-branes, the well-justified 
conclusion that our universe is dominated neither by monopoles nor by domain 
walls follows automatically. Scenarios that incorporate string-like relics may prove 
to be consistent with all observations. Such relics appear to be natural outcomes of 
today’s best understood string theory scenarios. 

The physical details of the strings in Type IIB model can be quite nontrivial, 
including the types of different species present and the range of string tensions. 
Away from the branes, p fundamental Fl-strings and q D1-strings can form a 
(p,q) bound state. Junctions of strings will be present automatically. If they live 
at the bottom of a throat, there can be beads at the junctions as well. To be 
specific, let us consider the case of the Klebanov-Strassler warped throat,°® whose 
properties are relatively well understood. On the gravity side, this is a warped 
deformed conifold. Inside the throat, the geometry is a shrinking S? fibered over 
a $°. The tensions of the bound state of p Fl-strings and that of q D1-strings 
were individually computed.”? The tension formula for the (p,q) bound states is 
given by Ref. 71 in terms of the warp factor h, the strings scale Mg and the string 


coupling gs, 
h2M2 |g (bM\* . o/xp 
Toa = Qqr g2 | ( T ) oe Gar em) 


where b = 0.93 numerically and M is the number of fractional D3-branes (that is, 
the units of 3-form RR flux F3 through the S$?). Interestingly, the F'1-strings are 
charged in Z,y and are non-BPS. The D-string on the other hand is charged in 


Z and is BPS with respect to each other. Because p is Zjy-charged with nonzero 
binding energy, binding can take place even if (p,q) are not coprime. Since it is a 
convex function, i.e. Th4p < Tp + Tp’, the p-string will not decay into strings with 
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smaller p. M fundamental strings can terminate to a point-like bead with mass” 


5 (bM\?? h 
Myeaa = 4/5 (=) ~Mszs 
T T 3 


irrespective of the number of D-strings around. Inside a D-brane, F1-strings break 
into pieces of open strings so we are left with D1-strings only, in which case they 
resemble the usual vortices in field theory. For M — oo and b = h = 1, the tension 
(30) reduces to that for flat internal space.’? Cosmological properties of the beads 
have also been studied.” 

Following from gauge-gravity duality, the interpretation of these strings in the 
gauge theory dual is known. The F’'l1-string is dual to a confining string while the D1- 
string is dual to an axionic string. Here the gauge theory is strongly interacting and 
the bead plays the role of a “baryon”. It is likely that different throats have different 
tension spectra similar to that for the Klebanov—Strassler throat. Finally, there is 
some evidence that strings can move in both internal and external dimensions and 
are not necessarily confined to the tips of the throat.” This behavior can show up 
as a cosmic string with variable tension. Following gauge-gravity duality, one may 
argue that one can also obtain these types of strings within a strongly interacting 
gauge theory; however, so far we are unable to see how inflation can emerge from 
such a nonperturbative gauge theory description. 

As we will describe in more detail, all indications suggest that, once pro- 
duced after inflation, a scaling cosmic string network will emerge for the stable 
strings. Before string theory’s application to cosmology, the typical cosmic string 
tension was presumed to be set by the grand unified theory’s (GUTs) energy 
scale. Such strings have been ruled out by observations. We will review the cur- 
rent limits shortly. Warped geometry typically allows strings with tensions that 
are substantially smaller, avoiding the observational constraints on the one hand 
and frustrating easy detection on the other. The universe’s expansion inevitably 
generates sub-horizon string loops and if the tension is small enough the loops will 
survive so long that their peculiar motions are damped and they will cluster in 
the manner of cold dark matter. This results in a huge enhancement of cosmic 
string loops within our galaxy (about 10° times larger at the Sun’s position than 
the mean throughout the universe) and makes detection of the local population a 
realistic experimental goal in the near future. We will focus on the path to detec- 
tion by means of microlensing. Elsewhere, we will discuss blind, gravitational wave 
searches. 

A microlensing detection will be very distinctive. The nature of a microlensing 
loop can be further confirmed and studied through its unique gravitational wave 
signature involving emission of multiple harmonics of the fundamental loop period 
from the precise microlensing direction. Since these loops are essentially the same 
type of string that makes up all forms of microscopic matter in the universe, their 
detection will be of fundamental importance in our understanding of nature. 
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6.1. Strings in brane world cosmology 


String-like defects or fundamental strings are expected whenever reheating pro- 
duces some closed strings toward the end of an inflationary epoch. Once inflation 
ends and the radiation dominated epoch begins, ever larger sections of this cosmic 
string network re-enter the horizon. The strings move at relativistic speeds and 
long lengths collide and break off sub-horizon loops. Loops shrink and evaporate 
by emitting gravitational waves in a characteristic time 7 = 1/(['Gy) where / is the 
invariant loop size, yw is the string tension while [ is numerically determined and 
I ~ 50 for strings coupled only to gravity.° To ease discussion, we shall adopt this 
value for T. 

String tension is the primary parameter that controls the cosmic string net- 
work evolution, first explored in the context of phase transitions in grand uni- 
fied field theories (GUTs), which may be tied to the string scale. Assuming that 
the inflation scale is comparable to the GUT scale, inflation generated horizon- 
crossing defects whose tension is set by the characteristic grand unification energy.° 
These GUT strings with Gu ~ 107° would have seeded the density fluctuations 
for galaxies and clusters but have long been ruled out by observations of the 
CMB.275:76 

Here, string theory comes to the rescue. Six of the string theory’s 9 spatial 
dimensions are stably compactified. The flux compactification involves manifolds 
possessing warped throat-like structures which redshift all characteristic energy 
scales compared to those in the bulk space. In this context, cosmic strings pro- 
duced after inflation living in or near the bottoms of the throats can have different 
small tensions.”” ®§! The quantum theory of one-dimensional objects includes a host 
of effectively one-dimensional objects collectively referred to here as superstrings. 
For example, a single D3-brane has a U(1) symmetry that is expected to be broken, 
thus generating a string-like defect. (For a stack of n branes, the U(n) D U(1) sym- 
metry is generic.) Note that fundamental superstring loops can exist only away from 
branes. Any superstring we observe will have tension 4 exponentially diminished 
from that of the Planck scale by virtue of its location at the bottom of the throat. 
Values like Gu < 107 (i.e. energies < 101? GeV) are entirely possible. A typical 
manifold will have many throats and we expect a distribution of 4, presumably 
with some Gp > 107". 

In addition to their reduced tensions, superstrings should differ from standard 
field theory strings (i.e. vortices) in other important ways: long-lived excited states 
with junctions and beads may exist, multiple noninteracting species of superstrings 
may coexist and, finally, the probability for breaking and rejoining colliding seg- 
ments (intercommutation) can be much smaller than unity.3? Furthermore a closed 
string loop may move inside the compacted volume as well. Because of the warped 
geometry there, such motion may be observed as a variable string tension both 
along the string length and in time. 
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6.2. Current bounds on string tension Gu and probability 
of intercommutation p 


Empirical upper bounds on Gy have been derived from null results for experi- 
ments involving lensing,®? 9! gravitational wave background and bursts,?? ' pul- 
sar timing®4:106111 and CMB radiation.?:1079,76.112°124 We will briefly review some 
recent results but see Refs. 125 and 126 for more comprehensive treatments. 

All bounds on string tension depend upon uncertain aspects of string physics 
and of network modeling. The most important factors include: 


e The range of loop sizes generated by network evolution. “Large” means compa- 
rable to the Hubble scale, “small” can be as small as the core width of the string. 
Large loops take longer to evaporate by emission of gravitational radiation. 

e The probability of intercommutation p is the probability that two crossing strings 
break and reconnect to form new continuous segments. Field theory strings have 
p ~ 1 but superstrings may have p as small as 10~?, depending on the crossing 
angle and the relative speed. The effect of lowering p is to increase the network 
density of strings to maintain scaling. 

e The physical structure of the strings. F1 strings are one-dimensional, obeying 
Nambu-—Goto equations of motion. D1 strings and field theory strings are vortices 
with finite cores. 

e The mathematical description of string dynamics used in simulations and cal- 
culations. The Abelian Higgs model is the simplest vortex description but the 
core size in calculations is not set to realistic physical values. Abelian Higgs and 
Nambu-Goto descriptions yield different string dynamics on small scales. 

e The number of stable string species. Superstrings have more possibilities than 
simple field theory strings. These include bound states of F1 and D1 strings with 
beads at the junctions and noninteracting strings from different warped throats. 

e The charges and/or fluxes carried by the string. Many effectively one-dimensional 
objects in string theory can experience nongravitational interactions. 

e The character of the discontinuities on a typical loop. The number of cusps and/or 
the number of kinks governs the emitted gravitational wave spectrum. 


While there has been tremendous progress, all of these areas are under active 
study. 

When the model-related theoretical factors are fixed each astrophysical experi- 
ment probes a subset of the string content of spacetime. For example, CMB power 
spectrum fits rely on well-established gross properties of large-scale string networks 
which are relatively secure but do not probe small sized loops which may domi- 
nate the total energy density and which would show up only at large @. Analysis of 
combined PLANCK, WMAP, SPT and ACT data!?* implies Gu < 1.3 x 1077 
for Nambu-Goto strings and Gu < 3.0 x 107" for field theory strings. Limits 
from optical lensing in fields of background galaxies rely on the theoretically well- 
understood deficit angle geometry of a string in spacetime but require a precise 
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accounting for observational selection effects. Analysis of the GOODS and COS- 
MOS optical surveys?®-?! yields Gu < 3 x 1077. Taken together these observations 
imply GuS1-3 x 1077. 

There is a well-established bound on the gravitational energy density at the 
time of Big Bang Nucleosynthesis because an altered expansion rate impacts light 
element yields (e.g. see Ref. 127). The gravitational radiation generated by any 
string network cannot exceed the bounds. If a network forms large Nambu—Goto 
loops of one type of string with intercommutation probability p an estimate of the 
BBN constraint is Gu <5 x 10~"p? (Ref. 128) (this depends implicitly on the loop 
formation size and a still-emerging understanding of how network densities vary 
with p). LIGO’s experimental bound on the stochastic background gravitational 
radiation! from a similar network implies a limit on Gy of the same general form 
as the BBN limit but weaker. Advanced LIGO is projected to reach Gu ~ 107!” 
for p = 1.18 The same LIGO results can be used to place a reliable, conservative 
bound of Gu < 2.6 x 10-4 over a much wider range of possible models, almost 
independent of loop size and of frequency scaling of the emission.!*° 

More stringent bounds rely on additional assumptions. Consider a specific set 
of choices for the secondary parameters of strings: one string species, Nambu—Goto 
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Fig. 3. Observational constraints limit string tension and intercommutation probability if the 
network forms large loops. Each line is a particular observational constraint. The region below and 
to the right of a line is disfavored. Smaller p increases the number density of loops and larger Gu 
increases gravitational signal amplitude. Conversely, the region to the left and above a line remains 
consistent with observational limits. The illustrated constraints are: Big Bang Nucleosynthesis 
(orange line) and CMB (yellow line) from the gravitational wave spectra in Ref. 128, pulsar time 
of arrival for the Parkes Pulsar Timing Array (blue line, based on the analytic form in Ref. 109) 
and for NANOGrav (red dotted line) in Ref. 111. All lines depend upon theoretical modeling of 
the network, especially the fraction of large loops formed. For example, the initials BP (green) 19? 
are an example in which this fraction has been inferred from simulations and the bound on Gy 
at p = 1 indicated (the line has not been calculated). This result is approximately ~30 times less 
restrictive on Gy than an equivalent analysis in which all loops are born at a single large size as 
has been assumed in all the other analyses. (For color version, see page II-CP14.) 
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dynamics, only gravitational interactions, large loops (a = 0.1), each with a cusp. 
The time of arrival of pulses emitted by a pulsar vary on account of the gravitational 
wave background. When the sequence is observed to be regular the perturbing 
amplitude of the waves is limited. The current limit is Gu<107° for p = 1 and 
<$10-! for p = 1073.1! Figure 3 is a graphical summary that illustrates some of 
the bounds discussed. The BBN (orange line) and CMB (yellow line) constraints 
are shown as a function of string tension Gy and intercommutation probability p. 
The Parkes Pulsar Timing Array limit (blue line)!°° and the NANOGrav limit (red 
dotted line)!!! are based on radiating cusp models. Each constraint rules out the 
area below and to the right of a line. Most analyses do not account for the fact that 
only a small fraction of horizon-crossing string actually form large loops which are 
ultimately responsible for variation in arrival times. In this sense the lines may be 
over-optimistic (see figure caption). 

Superconducting strings have also been proposed.!2° The bound on supercon- 
ducting cosmic strings is about Gu <10~1°.'%° Interestingly, it was pointed out 
that the recently observed fast radio bursts can be consistent with being produced 
by superconducting cosmic strings.!%! 

In short, cosmic superstrings are generically produced toward the end of inflation 
and observations imply tensions substantially less than the original GUT-inspired 
strings. Multiple, overlapping approaches are needed to minimize physical uncer- 
tainties and model-dependent aspects. There is no known theoretical impediment 
to the magnitude of Gy being either comparable to or much lower than the current 
observational upper limits. 


7. Scaling, Slowing, Clustering and Evaporating (see Ref. 133) 


Simulated cosmological string networks converge to self-similar scaling solu- 
tions.>8?:1!4 Consequently Qiong (Qtoop), the fraction of the critical density con- 
tributed by horizon-crossing strings (loops), is independent of time while the 
characteristic size of a loop formed at time ¢t scales with the size of the horizon: 
1 = at for some fixed a. To achieve scaling, long strings that enter the horizon 
must be chopped into loops sufficiently rapidly — if not, the density of long strings 
increases and over-closes the universe. In addition, loops must be removed so that 
Qicop stabilizes — if not, loops would come to dominate the contribution of normal 
matter. Scaling of the string network is an attractor solution in many well-studied 
models and the universe escapes the jaws of both Scylla and Charybdis. The inter- 
commutation probability determines the efficiency of chopping and yz determines 
the rate of loop evaporation. 

Studies of GUT strings? took Gu ~ 10~® (large enough to generate pertur- 
bations of interest at matter-radiation equality) and a small (set by early esti- 
mates of gravitational wave damping on the long strings) with the consequence 
Hr ~ a/TGu « 1 where H is the Hubble constant. Newly formed GUT loops 
decay quickly. The superstrings of interest here have smaller Gy so that loops of 
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a given size live longer. In addition, recent simulations!** 138 produce a range of 
large loops: 107+ Sa $0.25. The best current understanding is that ~ 10-20% of 
the long string length that is cut up goes into loops comparable to the scale of the 
horizon (a ~ 0.1) while the remaining ~ 80-90% fragments to much smaller size 
scales.'39:'40 The newly formed, large loops are the most important contribution 
for determining today’s loop population. 

Cosmic expansion strongly damps the initial relativistic center of mass motions 
of the loops and promotes clustering of the loops as matter perturbations 
grow.4!:142 Clustering was irrelevant for the GUT-inspired loops. They moved 
rapidly at birth, damped briefly by cosmic expansion and were re-accelerated to 
mildly relativistic velocities by the momentum recoil of anisotropic gravitational 
wave emission (the rocket effect) before fully evaporating.9?:!43:!44 GUT loops were 
homogeneously distributed throughout space. By contrast, below a critical tension 
Gy ~ 107° all superstring loops accrete along with the cold dark matter.!4? 

Loops of size 1 = lg = ['Gyto are just now evaporating where to is the age 
of the universe. The mean number density of such loops is dominated by network 
fragmentation when the universe was most dense, i.e. at early times. When they 
were born they came from the large end of the size spectrum, i.e. a substantial 
fraction of the scale of the horizon. The epoch of birth is t; = lg/a = [Gyuto/a. 
For Gu < 7 x 10~°(a/0.1)(50/T) loops are born before equipartition in ACDM, i.e. 
ti < teq. The smallest loops today have ly ~ 40pce(Gy/2 x 10~'°) with characteristic 
mass scale M, = 1.7 x 10°Mo(Gp/2 x 10-1)? for T = 50. 

Loop number and energy densities today are dominated by the scale of the 
gravitational cutoff, the smallest loops that have not yet evaporated. The charac- 
teristic number density dn/dlog! « (TG)~3/?(ateq/to)'/?/t3 and the energy den- 
sity dptoop/dlog! ~ TGutodn/dlogl. The latter implies Qioops « \/aGp/T whereas 
Qiong «x PG. In scenarios with small py it’s the loops that dominate long, horizon- 
crossing strings in various observable contexts. The probability and rate of local 
lensing are proportional to the energy density. 

Loops are accreted as the galaxy forms. Figure 4 illustrates schematically the 
constraints for a loop with typical initial peculiar velocity to be captured during 
galaxy formation and to remain bound today. The loop must lie within the inner 
triangular region which delimits small enough tension and early enough time of 
formation. Above the horizontal line capture is impossible; below the diagonal line 
detachment by the rocket effect has already occurred. The critical tension for loop 
clustering in the galaxy is set by the right-hand corner of the allowed region. 

To briefly summarize: F, the enhancement of the galactic loop number density 
over the homogeneous mean, simply traces €, the enhancement of cold dark matter 
over Qpm fc, for critical density p.. This encapsulates conclusions of a study of 
the growth of a galactic matter perturbation and the simultaneous capture and 
escape of network-generated loops.!4? At a typical galactocentric distance of 11 kpc 
logig F = logig E+ f(y) where log, € = 5.5 and f(y) = —0.337—0.064y?—g(y) and 
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Fig. 4. Bounds on formation time and string tension for a loop with initial velocity v; = 0.1 to 
be captured at physical radius 30kpc by galaxy formation. (1) Upper bounds on the formation 
time t;/to are given by the horizontal lines. The condition that cosmic drag lower the velocity 
to less than the circular rotation velocity today is given by the red line. The more stringent 
condition that capture occur at 30kpc is given by the turquoise line. (2) Upper bounds on the 
string tension Gy are given by the diagonal lines. The condition that the loop be younger than 
its gravitational wave decay timescale is given by the red line. The more stringent condition that 
the loop not be accelerated out of the galaxy by today is given by the green line. (3) The shaded 
region encompasses string tensions and formation times giving bound loops at 30 kpc for v; = 0.1 
and a = 0.1. The critical value of Gu below which clustering is possible is determined by the 
upper right hand corner of the green and turquoise lines. Lowering v; raises the limit on t;/to 
(horizontal lines moves upward); lowering a shifts the bound to smaller Gy (diagonal lines moves 
leftward). Shifting the loop orbital scale to smaller values (say the solar position) requires earlier 
formation times (horizontal lines shifts down) and allows larger Gy (green line shifts to the right 
but is limited by the red line which is fixed). (4) The geometric symbols illustrate the sensitivity 
of the rocket effect. These are numerical experiments examining the outcome today (t = to) for 
groups of 10-20 loops captured at 30 kpc with slightly different string tensions: stars = all loops 
bound, boxes = all loops ejected, triangles = some bound and some ejected (for clarity the points 
are slightly offset in the vertical but not the horizontal direction). (For color version, see page 
II-CP15.) 


g(y) = 5(1 + tanh(y — 7)). The tension-dependent deviation from loops as passive 
tracers of cold dark matter is f(y), y = log;g(Gu/10~'), fit for 0 < y < 5 from the 
capture study. Clustering saturates for small tensions: f(y) = f(0) for y < 0. The 
over-bar indicates quantities averaged over the spherical volume and, in the case of 
the loops, a weighting by loop length in the capture study. 

The extra piece g(y) plays a role for Gu > 10~'° where it describes the sup- 
pression in clustering as one approaches the upper right corner of the triangle in 
Fig. 4. Numerical simulations have not yet accurately determined it. 

A single f(y) fits a range of radii as long as € > 1 (typically, galactic distances 
less than ~ 100 kpc). So we can easily have an enhancement F ~ 10° for Gu ~ 107! 
at the Sun’s position. By comparison, GUT loops would have F = 1 for all positions 
and tensions. 
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The model accounts for the local population of strings and we have used it to 
estimate microlensing rates, and LISA-like, LIGO-like and NANOGrav-like burst 
rates. 


7.1. Large-scale string distribution 


We will start with a “baseline” description (loops from a network of a single, grav- 
itationally interacting, Nambu-—Goto string species with reconnection probability 
p = 1). The detailed description!*? was motivated by analytic arguments!8*° that 
roughly 80% of the network invariant length was chopped into strings with very 
small loop size (a ~ Gz) and by the numerical result!** 18 that the remaining 20% 
formed large, long-lived loops (a = 0.1). At a given epoch loops are created with 
a range of sizes but only the “large” ones are of interest for the local population. 
The baseline description is supposed to be directly comparable to numerical simu- 
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lations which generally take p = 1. The most recent simulations 
consistent. 

Next, we parameterize the actual “homogeneous” distribution in the universe 
when string theory introduces a multiplicity of string species and the reconnection 
probability p may be less than 1. And finally we will form the “local” distribution 
which accounts for the clustering of the homogeneous distribution. 

In a physical volume V with a network of long, horizon-crossing strings of ten- 
sion 2 with persistence length L there are V/L° segments of length L. The physical 
energy density is poo = pL/V = p/L?. The persistence length evolves as the uni- 
verse expands. A scaling solution demands L « ¢ during power law phases. There 
are also loops within the horizon; their energy is not included in p, and the aim 
of the model is to infer the number density of loops of a given invariant size. 

Kibble!“ developed a model for the network evolution for the long strings and 
loops in cosmology. It accounted for the stretching of strings and collisional inter- 
commutation (long string segments that break off and form loops; loops that recon- 
nect to long string segments). A variety of models of differing degrees of realism 
have been studied since then, guided by ever more realistic numerical simulations 
of the network. As a simple approximate description we focus on the Velocity One 
Scale model!*6 
the network turns out to be a rather small effect and is ignored. We extended exist- 
ing treatments by numerically evaluating the total loop creation rate in flat ACDM 
cosmology. The loop energy in a comoving volume varies like E, = Cp..pva? /L 
where C’ is the chopping efficiency, p is the intercommutation probability and v is 
the string velocity. All quantities on the right-hand side except p vary in time; C 
is a parameterized fit in matter and radiation eras. 

By integrating the model from large redshift to the current epoch one evaluates 
the fraction of the network that is lost to loop formation. The rate at which the 
loop energy changes is E; = Aja®/(p?t®) where JA is a slowly varying function of 
redshift z and p shown in Fig. 5. The plotted variation of A with redshift indicates 


in a recently elaborated form.'47!48 The reattachment of loops to 
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Fig. 5. The redshift-dependent rate of energy loss to loops experienced by the string network 
in ACDM cosmology for a given intercommutation probability p. This is an important input for 
predicting properties of today’s loops. The ordinate is the dimensionless measure of the rate of 
energy loss to loops, A = E)/ (ja? /(p?t?)). If the scale factor a(t) were a power law in time then A 
would be constant with respect to z. This is not the case in the ACDM cosmology with transitions 
from radiation-to-matter-to-A dominated regimes. Different lines show results for p = 1 (top) to 
p = 10~% (bottom) in powers of 10. If network density varied as 1/p? then all these lines would 
overlap. This is approximately true for small p but not for p > 0.1. 


departures from a pure scaling solution (consequences of the radiation to matter 
transition for a(t) and the implicit variation of chopping efficiency C). Knowing A 
as a function of redshift and p is the input needed to evaluate of the loop formation 
rate. Loops form with a range of sizes at each epoch. Let us call the size at the time 
of formation at. A very important feature predicted by theoretical analyses is that 
a large fraction of the invariant length goes into loops with a ~ Gy or smaller. The 
limiting behavior of numerical simulations as larger and larger spacetime volumes 
are modeled suggests ~ 80% of the chopped up long string bears that fate. Such 
loops evaporate rapidly without contributing to the long-lived local loop population. 
The remaining 20% goes into large loops with a ~ 0.1. Let us call the fraction of 
large loops f = 0.2. The birth rate density for loops born with size at, is 


dn\ fA 
(=) = apne aty). (31) 


A loop formed at t, with length J, shrinks by gravitational wave emission. Its size is 
(=, —TGp(t — ty) (32) 


at time t ([ ~ 50). 
The number density of loops of size | at time t is the integral of the birth rate 
density over loops of all length created in the past. For | < at or, equivalently, 
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ty < t we have 


dn _ (fA? (alts) ‘ 03 
Teor). | marae 83) 
_ '4+TGpt 
ty = =e (34) 
_ [Gu 
o=14—4 (35) 


The scaling dn/dl x (1 +TGyt)~* was already noted by Kibble.1*° 

The loop number distribution peaks at zero length but the quantity of interest 
in lensing is number weighted by loop length, ldn/dl. The characteristic scale at 
time t is 1, = [Gut, i.e. roughly the size of a newly born loop that would evaporate 
in total time t. The distribution [dn/dl peaks at 1 = (2/3)l,. 

For Gu < 7 x 10~*(a/0.1)(50/L) the loops near 1, today were born before 
equipartition, teq. We use a x t!/? to simplify the expression to give 


dn _ x fA —3/2 Oteg ne 
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where to is today and x = 1/I,. 
The numerical results are A ~ 7.68 for p= 1, f = 0.2, a= 0.1 and [ = 50 (and 
from ACDM teqg = 4.7 x 104 yr and to = 4.25 x 101’s). These give 


dn -6 Z —3/2 -3 
l a. - = L15s¢ 10 G+a)72h-38 kpe 5 (37) 
I, = 0.0206 p13 pe, (38) 
M, = 0.043 243 Mo. 39 
g 3 


Here, 13 = Gu/c?/10~'8 is an abbreviation for the dimensionless string tension 
in units of 10713. The baseline distribution lies below!47!48 on account of loss of a 
significant fraction of the network invariant length to small strings and of adoption 
the ACDM cosmology. 

Next, string theory modifications to the baseline are lumped into a common 


factor G 
dn dn 
on, = als 4 
( dl Yee @ ( dl i ») 


to give the description of the actual homogeneous loop distribution. Prominent 
among expected modifications is the intercommutation factor p. Large scale string 
simulations have not reached a full understanding of the impact of p < 1 though 
there is no question that G increases as a result. The numerical treatment of the 
Velocity One Scale model implies that A is a weak function of p for small p and, 
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in that limit, dn/dl « 1/p?. Ultimately, this matter will be fully settled via net- 
work simulations with p < 1. String theory calculations of the intercommutation 
probability suggests p = 10-'— 107-3 implying G = 10?—10°. 

The number of populated, noninteracting throats that contain other types of 
superstrings is a known unknown and is unexplored. In our opinion, there could 
easily be 100’s of such throats for the complicated bulk spaces of interest. 

In the string theory scenarios G = 1 is a lower limit. For the purposes of numer- 
ical estimates in this paper we adopt G = 10? (with a given tension) as the most 
reasonable lower limit. Much larger G are not improbable while lower G are unlikely. 
We should emphasize that strings in different throats would have different tensions, 
so adopting a single tension here yields only a crude estimate. 


7.2. Local string distribution 


Ifa loop is formed at time t with length / = at then its evaporation time r = 1/TGu. 
For Hubble constant H at ¢ the dimensionless combination Ht = a/(TGy) is a 
measure of lifetime in terms of the universe’s age. Superstring loops with a = 0.1 
and small yz live many characteristic Hubble times. 

New loops are born with relativistic velocity. The peculiar center of mass motion 
is damped by the universe’s expansion. A detailed study of the competing effects 
(formation time, damping, evaporation, efficacy of anisotropic emission of gravita- 
tional radiation) in the context of a simple formation model for the galaxy shows 
that loops accrete when p is small. The degree of loop clustering relative to dark 
matter clustering is a function of and approximately independent of J. Smaller jz 
means older, more slowly moving loops and hence more clustering. Below we give 
a simple fit to the numerical simulations to quantify this effect. 

The spatially dependent dark matter enhancement in the galaxy is 


e(r) = PoMlr) (41) 
OQpMPe 

where ppm(r) is the local galactic dark matter density and Qpmp- is average dark 
matter density in the universe. Low tension string loops track the dark matter with 
a certain efficiency as the dark matter forms gravitationally bound structures.'4? 
We fit the numerical results by writing the spatially dependent string enhancement 
to the homogeneous distribution as equal to the dark matter enhancement times 
an efficiency factor 


F(r) = E(r)107), (42) 
—0.337 — 0.064y? — 5(1+tanh(y—7)) for 0 < y, 

f(y) = (43) 
—0.337 for y < 0, 

y = 2+ logio H-13 (44) 


and j1_13 is the dimensionless tension in units of 10~1°. The clustering is never 100% 
effective because the string loops eventually evaporate. The efficiency saturates at 
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y = 0 or Gu/c? = 10-. The local string population is enhanced by the factor F 
with respect to the homogeneous distribution 


To complete the description of the local loop population we adopt empirical 
fits to the galaxy’s dark matter halo.!49 Model I is a “cored galaxy” center in 
which the central, limiting dark matter density is zero. Since winds from stars and 
ejection/heating by supernovae lift baryons out of star-forming regions, the center, 
they reduce the gravitational potential and tend to lower the dark matter density. 
Model IT is a “cusped galaxy” center in which the density formally diverges. Many 
N-body simulations show that collisionless structure formation in ACDM yields such 
profiles. The two models are thought to bracket the range of physical possibilities 
in the central regions and are in general agreement on large scales. 

Let us define the effective number density of loops as the energy density in the 
distribution of loops of all size, pioop, divided by energy of a loop of size equal to 
the characteristic gravitational cutoff 


n = Piso (46) 
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Fig. 6. The effective number density of string loops in two galactic models for a range of string 
tensions. The dotted red lines show results for a galactic model with a cusp at the center, the solid 
blue lines for a model with a core. The tension is Gu/c? = 10~! for the uppermost curve and 
increases by powers of 10 until reaching 10~® for the lowest curve. The effective number density 
depends upon the clustering of loops, the dark matter distribution within the galaxy, the network 
scaling and loop distribution and G the enhancement of string theory strings compared to field 
theory strings. (For color version, see page II-CP15.) 
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Figure 6 plots the effective number density of string loops for two descriptions of the 
galaxy’s dark matter distribution (core and cusp) as a function of galactocentric 
radius. The different lines show a range of string tensions, all for G = 10?. The 
quantity 7 depends upon the efficacy of loop clustering, the dark matter distribution 
within our galaxy and the network scaling model. There are two curves plotted for 
each tension. These agree at large radii but differ near the galactic center where 
the detailed form of the dark matter distribution is uncertain. 

For many experiments the rate of detection scales as n weighted by powers of ly. 
The microlensing rate, for example, is proportional to the product of two factors: the 
number of string loops along a given line of sight « 7 and the length of an individual 
string loop « l,. The event rate scales like nl, x 7G. At the Sun’s position, the 
effective number density increase by ~ 15 orders of magnitude as Gu/c? drops from 
10-8 to 10-1°. The microlensing event rate increases by ~ 8 orders of magnitude. 
As we will discuss shortly, for the same variation in tension the timescale for an 
individual microlensing event decreases by ~ 8 orders of magnitude. This creates a 
huge range of timescales of interest for the duration and rate of microlensing events. 


8. Detection 
8.1. Detection via Microlensing 


Consider a straight segment of string oriented perpendicular to the observer’s line 
of sight with respect to a background source as shown in Fig. 7. Let two photons 
from the source travel toward the string. The photons do not suffer any relative 
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Fig. 7. (Left) The conical spacetime geometry near a short segment of string (red line) permits 
photons to travel two paths (black arrows) from source (yellow ball) to observer when source, 
string and observer are nearly aligned. The ideal observer perceives two images of the source split 
by a small angle (black arc) proportional in size to string tension. For low string tension, the 
observer cannot resolve the separate images; however, the flux may be easily measured to be twice 
that of a single image (in the point source limit) at each wavelength (negligible Kaiser—Stebbins 
effect). (Right) The microlensing amplification of flux for a point source as a function of time for 
a string (green) is compared to a Newtonian point masses (blue; the ratio of impact parameter to 
Einstein radius is 0.31). No other known astrophysical sources produce this digital microlensing. 
(For color version, see page II-CP15.) 
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deflection during the fly-by as long as they pass around the string in the same 
sense. However, there is a small angular region about the string with two paths 
from the source to the observer. Background sources within angle Og = 8"Gy = 
1.04 x 10~3(Gy/2 x 10~1°) arcsec form two images. Unlike the case of a point mass, 
shear and distortion are absent. The angular size of a sun-like star at distance R is 
Oo/On = 4.5 x 10-4(2 x 107 1°/Gy) (10 kpc/R) so galactic stellar sources generally 
appear point-like for Gu=107'. 

Compact halo objects lens background sources and unresolved, lensed 
sources will appear to fluctuate achromatically in brightness. This is microlensing. 


Experimental efforts to detect microlensing phenomena have borne considerable 
t,152 


150,151 


frui 

Likewise, loops of superstring microlens background sources but have a special 
property: the source brightness varies by a factor of 2 as the angular region associ- 
ated with the string passes across the observer-source line of sight. The amplitude 
variation, schematically compared in Fig. 7, is quite distinctive for point sources. 
The internal motions of a loop are relativistic and generally dominate the motion of 
the source and the observer. Numerical calculations have established that microlens- 
ing occurs when light passes near a relativistic, oscillating string loop!°*; the effect 
is not limited to a stationary string. 

The total rate of lensing Rz implied for a distribution of loops dn/dl is pro- 
portional to the solid angle swept out per time cl/W3r? for loop J at distance r, or 
Ry, = fdldrr?dn/dl(cl/\V/3r?). Small tensions give large lensing rates, Ry, «x 1//Gu 
because the integral over dn/dl « I~?° is dominated by | ~ I,, the gravitational 
cutoff. 

Figure 8 illustrates the hierarchy of relevant timescales. The characteristic dura- 
tion of the event is dt = ROg/c ~ 630s(R/10kpc)(Gu/2 x 10~1°). Loops bound to 
the galaxy have center-of-mass motions v, ~ 220km s~!; microlensing of a given 
source will repeat ~ c/v;, times; new sources are lensed at rate ~ (up/c)Rr. The 
characteristic loop oscillation timescale which governs the intervals in repetitive 
microlensing of a single source is tose ~ lg/¢ = 135 yr (Gu/2 x 10719). 

The unique fingerprint of loop lensing is repeated achromatic flux doubling (dig- 
ital microlensing). Detection efficiency for digital lensing depends upon the string 
tension, the magnitude and angular size of the background stars and the time 
sampling of the observations. Estimates can be made for any experiment which 
repeatedly measures the flux of stellar sources. 

The first string microlensing search!** was recently completed using photometric 
data from space-based missions CoRoT!*** and RXTE.!°°:4 The methodology was 
potentially capable of detecting strings with tensions 1071 < Gu < 107" though 
the expected number of detections was limited by the available lines of sight studied 


©The 2006 mission was developed and operated by the CNES, with the contribution of Austria, 
Belgium, Brazil, ESA (RSSD and Science Program), Germany and Spain. 
The 1995 mission was developed and operated by NASA. 


Inflation, string theory and cosmic strings TI-309 


Flux 


-— 


Time 


Fig. 8. A Schematic Pattern. Digital microlensing doubles the flux over the time period that 
source, loop and observer are aligned within the deficit angle created by the string (red line). The 
repetition interval for lensing a particular source is the loop oscillation timescale (green line) and 
~ 10° repetitions in total occur (yellow line). The timescale for a new source to be lensed by the 
original loop (or the original source by a new loop) is much longer (black line). (For color version, 
see page II-CP16.) 


in the course of the missions. In principle, any photometry experiment that makes 
repeated flux measurements of an intrinsically stable astronomical source has the 
power to limit a combination of the number density of loops and string tension. One 
interesting possibility is the satellite GAIA launched by the European Space Agency 
in December 2013 designed for astrometry. It is monitoring each of about 1 billion 
stars about 70 times over a period of 5 years. Another is the Large Synoptic Survey 
Telescope (LSST) to be sited in Chile within the decade to photograph the entire 
observable sky every few days. Previous estimates for the rate of detections for 
GAIA"! and LSST” were encouraging. Now, detailed calculations for WFIRST 
are also available. 


8.2. WFIRST microlensing rates (see Ref. 158) 


The Wide-Field Infrared Survey Telescope® (WFIRST) is a NASA space observa- 
tory designed to perform wide-field imaging and slitless spectroscopic surveys of 
the near infrared sky. WFIRST will carry out a microlensing survey program for 
exo-planet detection in the direction of the galactic bulge by observing a total of 
2.8 square degrees for 1.2 years primarily in a broad long wavelength filter (W- 
band 0.927—2.0 um). Repeated, short exposures (~1 minute) of the same fields are 
the key to searching for and to monitoring the amplification of bulge sources by 
star—planet systems along the line of sight. Fortuitously, the WFIRST experiment 


°http://wfirst.gsfc.nasa.gov/. 
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is also sensitive to cosmic strings. We have evaluated the expected microlensing 
rate by cosmic strings for a realistic distribution of stars (stellar types, distances, 
velocities, etc.), dust obscuration and survey parameters (flux sensitivity, time of 
exposure and angular scale of stars). The lensing rate is split into digital events (the 
flux doubles), analog events (all potentially measurable flux enhancements given the 
signal-to-noise of the observations) and total events (all geometric configurations 
that can lens, whether detectable or not) evaluated for two galactic dark matter 
models (with cusp and with core at galactic center). Figure 9 shows the string lens- 
ing rate per square degree as a function of string tension Gy/c” for G = 107. The 
lensing rate for digital and analog events exceeds 10 per square degree per year for 
10-4 < Gy/c? < 101° for the cusp model and for 107!8 < Gu/c? < 107! for 
the core model. The results for the cusp and core models should be regarded as 
establishing the range of astrophysical possibilities for the loop density distribution 
within the experiment. The range of string theory possibilities (the variation of 
G) remains substantial. Digital and spatially resolved analog events can be reliably 
detected. Blending of sources in analog events needs to be simulated to fully char- 
acterize the fraction of such events that will be detectable. In any case, the decrease 
from blending is limited: sources brighter than magnitude 23 in the J-band (1.131— 
1.454 um) contribute at least 10% of the total analog rate, are well-separated and 
produce light curves detectable at high signal-to-noise. 

These results show that WFIRST microlensing searches can probe hitherto 
unexplored ranges of string tension. Future surveys of wider areas of the bulge 
and/or surveys lasting for longer periods of time can potentially scale up the total 
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Fig. 9. The intrinsic string lensing rate (events per sq degree per year) for WFIRST’s study of 
the bulge for G = 10. The three solid lines show digital (red), analog (green) and total (blue) 


event rates for the dark matter model with a cusp. The three dashed lines (same meaning for the 
colors) give the event rate for the model with a core. The analog rate does not account for blending 
which may diminish the detectable rate by no more than a factor of 10. (For color version, see 
page II-CP16.) 
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expected rate of detection by as much as 100. Surveys with shorter exposures times 
are able to discern larger numbers of digital events. 


8.3. Gravitational waves 


Besides uncovering a relic from the earliest moments of the universe’s formation, 
the observation of a superstring microlensing event provides some immediate infor- 
mation. If the event is resolved in time, the characteristic string tension is inferred. 
If repetitions are seen, the characteristic loop size is constrained. And any detection 
provides a precise sky location for follow up. Some classes of strings create electro- 
magnetic signatures but all generate gravitational radiation possibly observable by 
LISA-like (hereafter, LISA) and LIGO-like instruments. 

The scenario that LISA detects the distinctive signature of local loop emission 
has been contemplated for a blind, all-sky search.'°? They relied on the enhance- 
ment from local clustering!*! and concluded that strings with 10-'9 < Gu < 107" 
were potentially detectable via fundamental and low-order harmonics. They also 
estimated the background from the galaxy and from the universe as a whole. 

Therefore, it is no surprise that a loop microlensing event offers some excit- 
ing possibilities. The key considerations are: (1) The precise location afforded by 
microlensing dramatically reduces the signal search space on the sky. (2) A generic 
loop radiates a full range of harmonics with frequency f, = nf, = 2cn/l because 
the equations of motion are intrinsically nonlinear. (3) Many string harmonics fall 
near LISA’s peak sensitivity nx = fe/fi >> 1 (huisa ~ 107738 at f, ~ 10723 Hz 
for 1 year periodic variation’; tension pcrit = 2.9 x 107! corresponds to a 1 year 
period; n, = 1.6 x 10°(u/{erit)). (4) Galactic binary interference becomes problem- 
atic only at f < fwp where fwp ~ 107?-° Hz.'61 

LISA’s verification binaries have known positions on the sky, orbital frequen- 
cies, masses, distance limits etc. determined by optical measurements and other 
means.!©° A long-lived superstring loop detected by microlensing shares similar 
observational advantages: it has known position on the sky, distance upper limit 
estimated based on the microlensed star and emits a distinctive signal. This cre- 
ates the ideal observational situation in which much of the data analysis can be 
conducted with a single fast Fourier transform. 

Different types of loops generate different gravitational wave signatures. The 
solution for a Nambu-—Goto loop’s motion is the sum of left- and right-moving one- 
dimensional waves subject to nonlinear constraints. When both modes are smooth 
(no derivative discontinuities) the dynamical solution can form a cusp. At that 
instant a bit of the string reaches the speed of light. When one mode is smooth 
but the other has a discontinuity the loop’s dynamical solution includes a kink, a 
discontinuity in the string’s tangent vector that moves along the string at the speed 
of light in one direction. When both modes have discontinuities we call the solution 
a k-kink. The presence of cusps, kinks and k-kinks may be inferred from the paths 
of the tangent vectors of the left and right moving modes which are constrained to 
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lie on the surface of a sphere. These distinctions are relevant for gravitational wave 
emission: the cusp generates bursts of beamed radiation, the kink has a pulsar-like 
beam that generically sweeps across a substantial angle in the sky and the k-kink 
emits into a substantial fraction of the sky. 

The tendency for string loops to explore part of the internal space of a typical 
compactification during oscillations™ alters the cusp and kink beaming patterns 
with potentially important repercussions for rates of detection in gravitational wave 
experiments. 16? 164 

We model the beaming anisotropy, the harmonic power and phase of the gravita- 
tional wave signal (qualitatively and quantitatively extending previous analyses!°?). 
Standard signal detection theory characterizes the strength of detection and the 
precision with which measurements can be made by comparing the putative signal 
to noise sources. Here, we consider only detector noise described as an additive, 
stationary, random Gaussian process with one-sided spectral noise at frequency f 
is S(||f||). The strength of signal A(t) is psyp = \/2(h,h) where the symmetric 


inner product is (g,h) = nee ap HOR and g(f) is the Fourier transform of g(t) 


(the factor of 2 stems from one-sided noise versus two-sided signal). Figure 10 sum- 
marizes pgnr for detecting a single source with beamed cusp, beamed kink and 
generic, unbeamed emission. Upper and lower line pairs show the effect of ignor- 
ing and accounting for the galactic white dwarf interference. Confusion is minimal 
because the string signal extends to frequencies where the binary density is small 
and because the signal overlap between string and binary sources is small. 
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Fig. 10. The log of LISA’s signal strength (log;9 psnr) for 1 year of observation of cusp, kink 
and k-kink (top to bottom) at a distance of 1 kpc with a known position on the sky as a function 
of string tension. The orientation of the cusp and kink is for the maximum signal. Upper/lower 
line pairs show the effect of ignoring/accounting for white dwarf interference. For tensions with 
Gu/c? > 3 x 10~!? the fundamental loop period exceeds the length of the observation so excess 
power is seen but the frequency resolution is insufficient to measure individual harmonics. 
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The time-averaged microlensing rate is roughly proportional to the loop’s max- 
imum projected area divided by its period of oscillation but the gravitational wave 
emissivity is dominated by specific loop configurations having rapidly moving seg- 
ments when the loop is most contracted. Microlensing and beamed, gravitational 
wave emission generally will not and need not be simultaneous to be informative. 
We elaborate on this important point. 

Generic cusps and kinks have characteristic beaming patterns described above 
and undergo fully three-dimensional motions that yield time-averaged microlensing 
rates that are not particularly angle-dependent. Microlensing is a favorable means 
to locate such loops but the high frequency, directed gravitational wave beams they 
emit are visible only from special directions. A microlensing detection does not sig- 
nificantly influence the probability of intercepting beamed emission in experiments 
with duration exceeding the fundamental loop period. Instead, there is a roughly 
constant, low probability that a cusp that is present will beam in the observer’s 
direction and modest probability that a kink will (the beam’s angular size is smaller 
at higher frequencies of emission). 

The most likely configuration for k-kinks is a flat, degenerate box-like orbit with 
a well-defined plane. Special observers who lie in that plane will not see microlensing 
but most others are sensitive to the loop’s projected motion and see the effect if 
a suitable background source exists. The gravitational wave emission of galactic 
k-kinks is not strongly beamed but, as the signal estimates show, is still potentially 
detectable. Unbeamed emission from loops that happen to contain cusps and kinks 
should also be detectable whether or not the high frequency beam impinges on the 
observer. 

To summarize, there are excellent chances to detect some of this emission once 
the precise location of the source is known. If the fundamental string period is 
less than the duration of the experiment then the string’s frequency comb can be 
resolved and fit. Otherwise, there will be excess frequency-dependent power seen 
in the microlensing direction. The issue of confusion with local and cosmological 
string loops will be addressed in future studies. 

While nondetection by LISA would place bounds on tension and loop size the 
most informative scenario will be measuring the individual harmonics of gravi- 
tational wave emission. Cusps have almost smooth spectral amplitude and phase 
distributions, kinks show some mode-to-mode variation and k-kinks progressively 
more. The variation with harmonic order provides immediate information regard- 
ing derivative discontinuities, e.g. an experimental determination of cusp and kink 
content. This is directly applicable for understanding the string network in a cosmo- 
logical context. Specifically, the discontinuities are created when horizon-crossing 
strings are chopped into loops but are modified as the strings are stretched out 
by the expanding universe. It is important to understand if the loops exclusively 
contain discontinuities or, also, cusps. This relates to anticipated cosmologically dis- 
tant string sources that contribute to the gravitational wave background currently 
sought. 
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A measurement of the fundamental frequency is equivalent to a precise mea- 
surement of the invariant string length. It can be determined in 1 year to relative 
accuracy at least as good as Afi /f1 ~ 1/psnr. The gravitational decay time for 
the typical string loop is likely to be of order the age of universe. A study of Fig. 10 
suggests that the expected decay in a year Af, /f; ~ 10~1° is too small to measure 
directly unless the loop is unexpectedly close to the end of its life or very nearby. 
Instead, first, one will measure changes in frequency from center of mass motion 
and acceleration within the galaxy and place upper limits on Af) /f1. The latter 
will constrain string couplings to axion-like fields and astrophysical effects of nearby 
matter. Since loops are long-lived sources emitting over a wide range of harmonics, 
eventually, multiple detectors will be brought to bear and the loop’s decay mea- 
sured in a direct fashion. This program can yield a precise determination of the 
string tension. 


9. Summary 


String theory contains a consistent quantum gravity sector and possesses a deep 
mathematical and physical structure. It has become apparent that the theory’s rich- 
ness enables it to describe a huge multiplicity of solutions and that testing whether 
string theory is the theory of nature is no easy task. Fortunately and fundamen- 
tally, it possesses an unambiguous property, namely, the presence of strings. If the 
evolution of the early universe is able to produce stable or metastable closed strings 
that stretch across the sky, then these unusual objects may provide distinctive evi- 
dence for the theory. As more and more observational data have been collected 
and analyzed, most cosmologists now conclude that the universe started with an 
inflationary epoch. This conclusion quite naturally motivates investigations of the 
following interrelated issues: how inflation is realized, how and what type of cosmic 
strings are produced, how to search for and detect these objects and how to measure 
their properties. 

Here, we review how inflation may be realized within string theory and whether 
there are distinctive stringy features in string theory-motivated inflationary scenar- 
ios. In lieu of presenting a full review we sample some of the scenarios proposed 
already to give readers a taste of what can be achieved. As there are many ways to 
realize inflation within the string theory framework, we hope better data will even- 
tually indicate how inflation happens in string theory. As we have pointed out, some 
stringy inflationary scenarios can have interesting distinct features to be detected. 
We are also encouraged to note that cosmic superstrings are naturally produced 
after inflation. 

Next, we review and discuss cosmic superstring features and production. One 
underlying message is that, because of the intricacies of flux compactification, mul- 
tiple types of cosmic superstrings with a wide range of properties may be produced. 
Such superstrings are quite different from the vortices well studied within field the- 
ory. The tension may be drawn from a discrete spectrum and different string states 
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can cooperate to form junctions and beads. A future experimental observation of 
any of these properties can legitimately be taken as a smoking gun for string theory. 

We review ongoing searches for strings in cosmology. Although specific experi- 
mental results are model-dependent the diversity of approaches has jointly lowered 
the upper limit for string tension by about two or more orders of magnitude below 
GUT scales. This is an important progress. However, well-motivated string models 
with warped geometries can yield characteristic string tensions that are far smaller 
than current upper limits, perhaps all the way to the general scales of the Standard 
Model itself. As Feynman famously said about miniaturization, “there’s plenty of 
room at the bottom”. Likewise, for string theory, there is a huge range of plausible 
tensions yet to be explored. 

We review the new physics and cosmology of these relatively low tension cosmic 
strings, particularly the tendency for string loops to track structure formation in the 
matter-dominated eras. As low tension strings decay slow enough, their clustering 
happens like dark matter, with a 10° order of magnitude enhancement in density 
within the galaxy. This opens up intriguing possibilities to seek strings having 
tensions many orders of magnitude less than current upper limits. 

We review one promising possibility that, for tension 10-4 < Gu < 107, string 
loops within the Galaxy microlens stars within the Galaxy. This type of microlens- 
ing qualitatively differs from the well-studied microlensing effect of compact objects 
in the Galactic halo. Ongoing (e.g. GAIA of ESA) as well as planned (e.g. LSST, 
WFIRST) searches for variable stars and/or exo-planets are sensitive to cosmic 
superstring microlensing. This gives us hope that much improved upper limits on 
string tension and/or actual detections will emerge in the coming years. We outline 
the important role that gravitational wave observations might play if and when a 
string loop is detected. 
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We present a bird’s-eye survey on the development of fundamental ideas of quantum 
gravity, placing emphasis on perturbative approaches, string theory, loop quantum grav- 
ity (LQG) and black hole thermodynamics. The early ideas at the dawn of quantum 
gravity as well as the possible observations of quantum gravitational effects in the fore- 


seeable future are also briefly discussed. 


Keywords: Perturbative quantum gravity; string theory; loop quantum gravity; black 


hole thermodynamics; quantum gravity phenomenology. 


PACS Number: 04.60.—m 


1. Prelude 


Quantum gravity is the research that seeks a consistent unification of the two 
foundational pillars of modern physics — quantum theory and Einstein’s theory 
of general relativity. It is commonly considered as the paramount open problem 
of theoretical physics, and many fundamental issues — such as the microscopic 
structure of space and time, the origin of the universe, the resolution of spacetime 
singularities, etc. — relies on a better understanding of quantum gravity. 
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The quest for a satisfactory quantum description of gravity began very early. 
Einstein after proposing general relativity thought that quantum effects must mod- 
ify general relativity in his first paper on gravitational waves in 1916! (although he 
switched to a different point of view working on the unification of electromagnetism 
and gravitation in the 1930s). Klein argued that the quantum theory must ulti- 
mately modify the role of spatiotemporal concepts in fundamental physics? * and 
his ideas were developed by Deser.° With the interests and developments of Rosen- 
feld, Pauli, Blokhintsev, Heisenberg, Gal’perin, Bronstein, Frenkel, van Dantzig, 
Solomon, Fierz and many other researchers, three approaches to quantum gravity 
after World War II had already been enunciated in 1930s pre-World War II as 
summarized by Stachel®: 


(i) Quantum gravity should be formulated by analogy with quantum electrody- 
namics (Rosenfeld, Pauli and Fierz). 

(ii) The unique features of gravitation will require special treatment — the full 
theory with its nonlinear field equations must be quantized and generalized 
as to be applicable in the absence of a background metric (Bronstein and 
Solomon). 

(iii) General relativity is essentially a macroscopic theory, e.g. a sort of thermody- 
namics limit of a deeper, underlying theory of interactions between particles 
(Frenkel and van Dantzig). 


Many ideas of the early time continue to provide valuable insight about the nature 
of quantum gravity even today. For example, the 1939 work on linearized general 
relativity as a spin-2 field by Fierz and Pauli’ inspired a recent development on 
massive gravity, bimetric gravity, etc. 

Modern work on quantum gravity, however, did not really start-off until the 
development of a canonical formalism in 1959-1961 by Arnowitt, Deser and Misner 
(ADM) for the case of asymptotically flat boundary conditions® 1° (for a review, see 
Ref. 19). This served as the basis for Schoen and Yau’s proof that the classical energy 


is bounded below??:?! 


and is the starting point for numerical simulations of classical 
general relativity. Of course the ADM formalism also provides the Hamiltonian and 
canonical variables whose quantization defines quantum general relativity. However, 
the sacrifice of manifest covariance made perturbative computations prohibitively 
difficult, although many strategies to tackle this difficulty have been suggested, e.g. 
in loop quantum gravity (LQG). 

Ninety-nine years passed since the very first conception of Einstein. Today, 
quantum gravity has grown into a vast area of research in many different (both 
perturbative and nonperturbative) approaches. Many directions have led to sig- 
nificant advances with various appealing and ingenious ideas; these include causal 
sets, dynamical triangulation, emergent gravity, H-space theory, LQG, noncom- 
mutative geometry, string theory, supergravity, thermogravity, twistor theory and 
much more. (For surveys and references on various approaches of quantum gravity, 
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see Refs. 22-25; for detailed accounts of the history and development, see Refs. 6 
and 26; for a popular science account of quantum gravity, see the book Ref. 27.) 


2. Perturbative Quantum Gravity 


The quantization of general relativity on the perturbative level (with or without 
matter) was the life work of DeWitt, for which he was awarded the 1987 Dirac Medal 
and the 2005 Einstein Prize.?® °° His program eventually succeeded — against the 
expectation of most expert opinion during the 1960s — in generalizing Feynman’s 
covariant quantization of quantum electrodynamics so that it could be applied as 
well to theories such as general relativity and Yang-Mills that are based on non- 
Abelian gauge symmetries. There were three key steps: 


(i) The introduction of the background field method for representing the effective 
action as a gauge invariant functional of the fields.?93! This made clear the 
connection between invariant counterterms and all possible ultraviolet diver- 
gences of scattering amplitudes at a fixed order in the loop expansion. 

(ii) The realization that non-Abelian gauge symmetries require the inclusion of 
opposite-normed ghost fields to compensate the effect of unphysical polariza- 
tions in loop corrections.?93? 

(iii) The development of invariant regularization techniques, the first of which was 
dimensional regularization.**:34 This permitted efficient computations to be 
made without the need for noninvariant counterterms. 


The first application of these techniques was made in 1974 by ’t Hooft and 
Veltman.*° They showed that general relativity without matter has a finite S-matrix 
at one-loop order, but that the addition of a scalar field leads to the need for higher 
curvature counterterms that would destabilize the universe. Very shortly thereafter 
Deser and van Nieuwenhuizen showed that similar unacceptable counterterms are 
required at one-loop for general relativity plus Maxwell’s electrodynamics,°®?" for 
general relativity plus a Dirac fermion®® and (with Tsao) for general relativity plus 
Yang—Mills.°° The fate of pure general relativity was settled in 1985 when Goroff 
and Sagnotti showed that higher curvature counterterms are required at two-loop 
onder!’ = 

These results show that quantum general relativity is not fundamentally con- 
sistent as a perturbative quantum field theory. Opinion is divided as to whether 
this means that general relativity must be abandoned as the fundamental theory 
of gravity or whether quantum general relativity might make sense nonperturba- 
tively. String theory discards general relativity, and does not even employ the met- 
ric as a fundamental dynamical variable. LQG and causal dynamical triangulation 
are approaches that attempt to make sense of quantum general relativity without 
employing perturbation theory. At this time it would be fair to say that there is 
not yet any fully successful quantum theory of gravity. 
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But being unable to do everything is not at all the same as lacking the ability 
to do anything. Perturbative quantum general relativity (with or without matter) 
can still be used in the standard sense of low-energy effective field theory,**-44 in 
the same way that Fermi theory was for years employed to understand the weak 
interactions, even including loop effects.*°4® The most interesting effects tend to 
occur in curved spacetime backgrounds, on which one can also consider quantizing 
only matter, without dynamical gravity.*” 

When working on curved backgrounds one often finds the formalism of flat space 
scattering theory to be inappropriate because it is based on incorrect assumptions 
about free vacuum at very early times (which actually began with an initial sin- 
gularity) and at very late times (which may be filled with ensembles of particles 
created by the curved geometry). Solving the effective field equations under these 
conditions is especially troublesome because the in-out effective field equations at 
a spacetime point 2” contain contributions from points x/" far in its future, and 
because even the in-out matrix elements of Hermitian operators such as the metric 
and the Maxwell field strength tensor can develop imaginary parts. For these rea- 
sons it is often more appropriate to employ the Schwinger—Keldysh formalism. This 
is a diagrammatic technique that is almost as simple to use as the standard Feyn- 
man rules, which gives true expectation values instead of in-out matrix elements. 
It was devised for quantum mechanics in 1960 by Schwinger.*® Over the next few 
years it was generalized to quantum field theory by Mahanthappa and Bakshi,*® >! 
and to statistical field theory by Keldysh.°? Although Schwinger—Keldysh effective 
field equations are nonlocal, they are causal — in the sense that only points 2x’” 
in the past of 2” contribute — and the solutions for Hermitian fields are real.°*>4 
They are the natural way to study quantum effects in cosmology®> and nonequilib- 
rium quantum field theory.°® Weinberg recently devised a variant of the formalism 
which is especially adapted to computing the correlation functions of primordial 
inflation.” °* 

For a recent review of perturbative quantum gravity, see Ref. 59. 


3. String Theory 


On the other hand, the search for a completely consistent theory of quantum grav- 
ity has never stopped. Although it does not embrace general relativity at the foun- 
dational level and it is disputable whether it can be genuinely formulated in a 
background-independent, nonperturbative fashion, string theory is arguably consid- 
ered by many as the most promising candidate for a consistent theory of quantum 
gravity on the grounds that its low-energy limit surprisingly gives rise to (the super- 
symmetric extension of) the theory of (modified) gravity plus other force and matter 
fields. In the framework of string theory, the point-like particles of particle physics 
are replaced by (the different quantum excitations of) minuscule one-dimensional 
objects called strings. In addition to being a quantum theory of gravity, string 
theory also aims to unify all fundamental forces and all forms of matter, striving 
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for the ultimate goal of being a “theory of everything”. (For textbooks on string 
theory, see Refs. 60-62; for a detailed account of its history and development, see 
Ref. 63.) 

String theory was born in the late 1960s as a never completely successful theory 
of the strong nuclear force and was later recognized as a suitable framework for a 
quantum theory of gravity (see Ref. 64 for its “prehistory” ). The idea of identifying 
the string action as the area of the worldsheet of the string traveling in spacetime 
was introduced independently by Nambu,®° Goto® and Hara®’ in the early 1970s. 
The modern treatment of string theory based on the worldsheet path integral of the 
Polyakov action was initiated by Polyakov in 1981 (see Refs. 68 and 69) and has led 
to an intimate link with the conformal field theory (CFT). The worldsheet CFT in 
the presence of spacetime background fields was studied by Callan et al.”°; the con- 
formal invariance demands beta functions of the worldsheet field theory to vanish 
identically and consequently yields the equations of motion of the background fields 
that bear remarkable resemblance to the Einstein field equation and (non-Abelian) 
Maxwell’s equations with higher-order corrections. 

Quantization of the bosonic string requires the number of spacetime dimen- 
sions to be 26, of which 22 extra spatial dimensions are thought to be compact- 
ified in the deep microscopic scale and thus undetectable at low energies. The 
bosonic string theory is unsatisfactory in two aspects: First, it entails existence of 
negative-normed tachyon fields and consequently is nonunitary and inconsistent; 
second, it does not contain fermions and thus cannot account for the quarks and 
leptons in the standard model. In the 1980s, bosonic string theory was extended 
into superstring theory, which incorporates both bosonic and fermionic degrees 
of freedom via supersymmetry and requires six extra spatial dimensions to be 
compactified on a Calabi-Yau manifold. Superstring theory not only gets rid of 
negative-normed fields but also includes fermions as desired. There are basically 
two (equivalent) approaches to embody supersymmetry in string theory: (i) The 
Ramond—Neveu-Schwarz (RNS) formalism with manifest supersymmetry on the 
string worldsheet"!'”?; (ii) the Green-Schwarz (GS) formalism with manifest super- 
symmetry on the background spacetime.’”* The first superstring revolution began 
in 1984 with the discovery by Green and Schwarz”® that the cancellation of gauge 
and gravitational anomalies demands a very strong constraint on the gauge symme- 
try: The gauge group must be either SO(32) or Eg x Eg. Eventually, five consistent 
but distinct superstring theories were found: Type I, type IIA, type IB, SO(32) 
heterotic and Eg x Eg heterotic. 

In the late 1980s, it was realized that the two type I] theories and the two 
heterotic theories are related by T-duality, which, simply speaking, maps a string 
winding around a compactified dimension of radius R to that of radius (2/R, where 
é, is the string length scale. (For reviews on T-duality, see Refs. 77 and 78.) A few 
years later, S-duality was discovered as another kind of duality that maps the string 
coupling constant g, to 1/g;. The two basic examples are the duality that maps the 
type I theory to the SO(32) heterotic theory”? and the duality that maps the type 
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IIB theory to itself.8° As S-duality relates a strongly coupled theory to a weakly 
coupled theory, it provides a powerful tool to explore nonperturbative behaviors 
of a theory with g; > 1, given the knowledge of the dual theory perturbatively 
obtained with g; < 1. In the mid-1990s, understanding of the nonperturbative 
physics of superstring theory progressed significantly, revealing that superstring 
theory contains various dynamical objects with p spatial dimensions called p-branes, 
in addition to the fundamental strings. Particularly, type I and II theories contain 
a class of p-branes, known as D-branes (or Dp-branes, more specifically), upon 
which open strings can end in Dirichlet boundary conditions. In 1995, Polchinski 
identified D-branes as solitonic solutions of supergravity that are understood as 
(generalized) black holes,®! a landmark discovery that heralded the prominence of 
D-brane dynamics. 

Furthermore, studies on the S-duality of type II5?-8% and Eg x Eg heterotic 
theories sprang a big surprise: These two theories grow an eleventh dimension of 
size gsfs at strong coupling. These discoveries together with the aforementioned 
relations between different superstring theories via T- and S-dualities brought about 
the second superstring revolution that took place around 1994-1997, suggesting the 


84,85 


existence of a more fundamental theory in 11 dimensions. The existence of such a 
theory was first conjectured and named as M-theory by Witten at a string theory 
conference at the University of Southern California in 1995. M-theory is supposed 
to be the fundamental theory of everything, of which the five distinct superstring 
theories along with the eleven-dimensional theory of supergravity are regarded as 
different limits. Although the complete formulation of M-theory remains elusive 
(and whether “M” should stand for “mother”, “magic”, “mystery”, “membrane”, 
“matrix”, etc. remains a matter of taste), a great deal about it can be learned 
by virtue of the fact that the theory should describe two- and five-dimensional 
objects known as M-branes (M2- and M5-branes, respectively) and reduce to eleven- 
dimensional supergravity theory at the low-energy limit. Investigations of M-theory 
have inspired a great number of important theoretical results in both physics and 
mathematics. 

A collection of coincident D-branes in string theory or M-branes in M-theory 
produces a warped spacetime with flux of gauge fields akin to a charged black hole, 
as the branes are sources of gauge flux and gravitational curvature. The low-energy 
limit of the gauge theory on the branes’ worldvolume (referred to as “boundary” ) is 
found to describe the same physics of string theory or M-theory in the near-horizon 
geometry (referred to as “bulk”). In this way, one is led to conjecture a remark- 
able duality that relates conventional (nongravitational) quantum field theory on 
the boundary to string theories or M-theory on the bulk. This gauge theory /string 
theory duality is often referred to as anti-de Sitter (Ads)/CFT correspondence, 
which was first spelled out in a seminal paper by Maldacena in 1997.°° (A detailed 
review on AdS/CFT correspondence was given in Ref. 87.) By considering N coin- 
cident D3-branes in the type IIB theory, one obtains the celebrated example: The 
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type IIB theory in Ad$; x S° (the product space of five-dimensional anti-de Sitter 
space and five-dimensional sphere) is equivalent to the SU(N) super Yang-Mills 
theory with NV = 4 supersymmetry on the four-dimensional boundary (which is a 
superconformal field theory). Many other examples have also been substantiated 
to various degrees of rigor. In the attempt to construct the complete formulation 
of M-theory, the AdS/CFT correspondence sets out a new strategy based on the 
holographic principle, which posits that the physics of a bulk region is completely 
encoded on its lower-dimensional boundary, as originally propounded by ’t Hooft 
in 1993°° and elaborated by Susskind in 1994.°° 

In string theory, the shape and size of the compactified manifold and conse- 
quently the fundamental physical constants (lepton masses, coupling constants, 
cosmological constant, etc.) are dynamically determined by vacuum expectation 
values of scalar (moduli) fields. The crucial question arises known as the moduli- 
space problem or moduli-stabilization problem: What mechanism stabilizes the com- 
pactified manifold and uniquely determines the physical constants? Answers to this 
problem have been proposed in the approach of flux compactifications as a general- 
ization of conventional Calabi-Yau compactifications, which were first introduced 
by Strominger in 1986°° and by de Wit et al. in 19879! and now have become a 
rapidly developing area of research. The idea is to generate a potential that sta- 
bilizes the moduli fields by compactifying string theory or M-theory on a warped 
geometry. In a warped geometry, the fluxes associated with certain tensor fields 
thread cycles of the compactified manifold; as the magnetic flux of an n-form field 
strength through an n-cycle depends only on the homology of the n-cycle, the flux 
stabilizes due to the flux quantization condition and gives rise to a nonvanishing 
potential. Flux compactifications were first studied in the context of M-theory by 
Becker and Becker in 1996.9? (A comprehensive review on flux compactifications 
was given in Ref. 93.) 

In 1999, Gukov et al. made it evident that flux compactfications generate non- 
vanishing potential for moduli fields, leading to a solution to the moduli-space 
problem.°* However, since the fluxes can take many different discrete values over 
different homology cycles of different (generalized) Calabi-Yau manifolds, flux 
compactifications typically yield a vast multitude (commonly estimated to be of 
the order 10°°° (see Ref. 95)) of possible vacuum expectation values, referred to as 
the string theory landscape, and therefore we have to abandon the long-cherished 
hope that the fundamental physical constants are supposed to be uniquely fixed by 
string theory or M-theory. In response to this problem, Susskind in 2003°° proposed 
the anthropic argument of the string theory landscape as a concrete implementa- 
tion of the “anthropic principle”, which suggests that physical constants take their 
values not because fundamental laws of physics dictate so but rather because such 
values are at the loci of the landscape that are suitable to the existence of (intelli- 
gent) life (in order to measure these constants). The properties of the string theory 
landscape was shortly analyzed in a statistical approach by Douglas.®° The scientific 
relevance of the anthropic landscape has sparked fierce debate and remained highly 


II-332 S. Carlip et al. 


controversial, yet it has gradually gained popularity (especially in the context of 
cosmology). 

The application of flux compactifications as well as the string theory landscape 
to the study of cosmology has spawned a whole new growing research area known 
as string cosmology. A notable subfield of it is brane cosmology, which posits that 
our four-dimensional universe is restricted to a brane inside a higher-dimensional 
space, as first proposed by Randall and Sundrum in 1999.97 The primary goal of 
string cosmology is to place field-theoretic models of cosmological inflation on a 
firmer logical ground from the perspective of string theory or, more boldly, to open 
entirely new realms of pre-big bang scenarios that cannot be described by any field- 
theoretic models. In 2003, Kachru et al. outlined the construction of metastable 
de Sitter vacua®® and shortly investigated the application to inflation.°® String 
cosmology has made some concrete predictions that will soon be confronted with 
the near-future astrophysical observations. (For reviews on string cosmology, see 
Refs. 100 and 101 and the textbook Ref. 102.) 


4. Loop Quantum Gravity 


The primary competitor of string theory for a complete quantum theory is LQG. 
In sharp contrast to the ambitious goal of string theory, LQG does not intend to 
pursue a theory of everything that unifies all force and matter fields but deliberately 
adopts the “minimalist” approach in the sense that it focuses solely on the search 
for a consistent quantum theory of gravity without entailing any extraordinary 
ingredients such as extra dimensions, supersymmetry, etc. (although many of these 
can be incorporated compatibly). The beauty of LQG lies in its faithful attempt 
to establish a conceptual framework whereby the apparently conflicting tenets of 
quantum theory and Einstein’s theory of general relativity conjoin harmoniously, 
essentially in a nonperturbative, background-independent fashion. (For textbooks 
on LQG, see Refs. 103-105; for a detailed account of its history and development, 
see Ref. 106; for a recent review of LQG, see Ref. 107.) 

The research of LQG originated from the works by Ashtekar in 1986-1987 ,10%:109 
which reformulated Einstein’s general relativity into a new canonical formalism in 
terms of a self-dual spinorial connection and its conjugate momentum, now known 
as Ashtekar variables, casting general relativity in a language closer to that of 
the Yang-Mills gauge theory. The shift from metric to Ashtekar variables pro- 
vided the possibility of employing nonperturbative techniques in gauge theories 
and opened a new avenue eventually leading to LQG. Shortly thereafter, Jacobson 
and Smolin discovered that Wilson loops of the Ashtekar connection are solutions 
of the Wheeler-DeWitt equation (the formal equation of quantum gravity).!!° This 
remarkable discovery led to the “loop representation of quantum general relativity” 
introduced by Rovelli and Smolin in 1988-1990.1!!:1!? The early developments of 
LQG were reviewed in Ref. 113. 
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In the 1990s, by the works of Briigmann et al.,1141! it became clear that inter- 
sections of loops are essential for the consistency of the theory and quantum states 
of gravity should be formulated in terms of intersecting loops, i.e. graphs with 
links and nodes. Later on, inspired by Penrose’s speculation on the combinatorial 
structure of space,!!® Rovelli and Smolin!” obtained an explicit basis of states of 
quantum geometry known as spin networks, which are oriented graphs labeled by 
numbers associated with spin representations of SU(2) on each link and node. The 
mathematical construction of spin networks was systematized by Baez.1!®1!9 The 
idea that LQG could predict discrete quantum geometry was first suggested in the 
study of “weave states”.!2° In 1995, the area and volume operators were defined 
upon spin networks and their eigenvalue spectra were found to be discrete,!?! show- 
ing that quantum geometry is indeed quantized in the Planck scale as had long been 
speculated. Rigorous and systematic studies of geometry operators and their eigen- 
values were further elaborated (e.g. see Refs. 122-124). 

The physical interpretation of spin networks (or more precisely, diffeomorphism- 
invariant knot classes of spin networks, known as s-knots) is extremely appealing: 
They represent different quantized three-dimensional geometries, which are not 
quantum excitations in space but of space. This picture manifests the paradigm of 
background independence of general relativity, as any reference to the localization 
of spin networks is dismissed. Furthermore, inclusion of matter fields into LQG 
does not require a major revamp of the underlying framework. (For a systematic 
account of inclusion of matter, see Ref. 104.) The quantum states of space plus 
matter naturally extend the notion of spin networks with additional degrees of 
freedom. In the presence of matter fields, the background independence becomes 
even more prominent, as geometry and matter fields are genuinely on the equal 
footing and reside on top of one another via their contiguous relations in the spin 
network without any reference to a given background. 

While kinematics of LQG is well understood in terms of spin networks, its 
dynamics (i.e. evolution of spin networks) and low-energy (semiclassical) physics 
is much more difficult and unclear. An anomaly-free formulation of the quantum 
dynamics was first obtained by Thiemann in 1996,!?° and shortly the formulation 
was fully developed in his remarkable “QSD” series of papers.!?® 18 Since then, a 
great number of variant approaches have been pursued, but the quantum dynamics 
remains very obscure, largely because implementation of the quantum Hamiltonian 
constraint is very intricate. The Master constraint program}? initiated by Thie- 
mannn proposed an elegant solution to the difficulties and has evolved into a fully 
combinatorial theory known as algebraic quantum gravity (AQG).18° 183 Mean- 
while, various techniques, notably by the idea of holomorphic coherent states ,'34 1%° 
have been devised to investigate the low-energy physics. Up to now, LQG has 
achieved considerable progress in understanding the quantum dynamics and pro- 
viding contact with the low-energy physics. 
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Applying principles of LQG to cosmological settings leads to the symmetry- 
reduced theory known as loop quantum cosmology (LQC), which was originally 
proposed by Bojowald in 1999 and reached a rigorous formulation around 2006. 
Providing a “bottom-up” approach to the full theory of LQG, LQC has become 
the most well-developed subfield of LQG and led to many significant successes. Most 
notably, it suggests a new cosmological scenario where the big bang is replaced by 
the quantum bounce, which bridges the present expanding universe with a preexis- 
tent contracting counterpart. Absence of cosmological singularities has been shown 
to be robust for a great variety of LQC models, therefore affirming the long-held 
conviction that singularities in classical general relativity should be resolved by the 
effects of quantum gravity. (For textbooks and reviews on LQC, see Refs. 140-143.) 

The standard formalism of LQG adopts the canonical (Hamiltonian) approach 
but is closely related to the covariant (sun-over-histories) approach of quantum 
gravity known as the spin foam theory. The concept of a spin foam, which can be 
viewed as a (discrete) “worldsurface” swept out by a spin network traveling and 
transmuting in time, was introduced in 1993 as inspired by the work of Ponzano— 
Regge model!“4 
ity by Perez et al. in the 2000s. A spin foam represents a quantized spacetime in 


and later developed into a systematic framework of quantum grav- 


the same sense that a spin network represents a quantized space; the transition 
amplitude from one spin network to another is given as the discrete sum (with 
appropriate weights) over all possible spin foams that connect the initial and final 
spin networks. (For reviews on the spin foam theory, see Refs. 145-148.) 

LQG has grown into a very active research field pursued in many directions, both 
in the traditional canonical approach and in the covariant (i.e. spin foam) approach. 
Recently, the precise connection between the canonical and covariant approaches 
has become one of the central topics of LQG. The merger of different approaches 
has yielded profound insight and suggested a new theoretical framework referred 
to as covariant LQG, which provides new conceptual principles and could pave a 
royal road to a complete theory of quantum gravity as advocated by Rovelli!4® 15! 
(also see Ref. 105 for a comprehensive account). 


5. Black Hole Thermodynamics 


Perhaps the most notable achievement in the study of quantum gravity so far 
was the discovery that black holes are not really black, but emit thermal Hawking 
radiation. The surprising fact that black holes behave as thermodynamic objects 
has radically affected our understanding of general relativity and given valuable 
hints about the nature of quantum gravity. 

The study of black hole thermodynamics can be traced back to 1972 when 
152 __ a prop- 
erty reminiscent of the second law of thermodynamics. The resemblance between 
black hole mechanics and thermodynamics was further enhanced by the discovery 
of analogs of other thermodynamical laws and formally elaborated in the paper by 


Hawking proved that an area of an event horizon can never decrease 
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Bardeen et al.!®? in terms of the “four laws of black hole mechanics”, which are 
in clear parallel with the four usual laws of thermodynamics as the surface gravity 
(«) plays the role of temperature and the area of horizon (Ajor) of entropy. Around 
the same time, considering a series of thought experiments initiated by his advisor, 
Wheeler,!** Bekenstein argued that the black hole entropy has to be of the form 
Spu = nkpAnor/(hG), where kg is Boltzmann’s constant and 7 is a proportional 
constant of order one, and correspondingly obtained the “generalized second law of 
thermodynamics”, which asserts that the total of ordinary entropy of matter plus 
the black hole entropy never decreases.!°> 1°? 

In 1974, using the newly developed techniques of quantum field theory in curved 
spacetime, Hawking!°*:!59 showed that all black holes radiate as black bodies with 
the Hawking temperature Ty = fix /(27). The first law of black hole mechanics 
then determines the entropy as Spy = +kp Anor /(RG), which confirms Bekenstein’s 
expression with 7 = 1/4 and is often referred to as the Bekenstein-Hawking for- 
mula. As the equivalence principle implies that the gravitational field near a black 
hole horizon is locally equivalent to uniform acceleration in a flat spacetime, it 
is expected that an accelerated observer should perceive an effect similar to the 
Hawking radiation. In 1976, Unruh!®° demonstrated that an observer moving with 
a constant proper acceleration a in Minkowski spacetime indeed see a thermal flux 
of particles with the Unruh temperature Ty = ha/(27), which is in almost exact 
analogy with the Hawking temperature. At almost the same time, Bisognano and 
Wichmann gave an independent and mathematically rigorous proof of the Unruh 
effect in quantum field theory.'! To resolve the worrying issue whether the flux of 
particles formally obtained via the standard quantum field-theoretical definitions 
of vacuum states and particle numbers can be physically interpreted as “particles” 
as seen by detectors, Unruh!© and DeWitt!®? devised simple models of particle 
detectors and showed that the answer is affirmative for these detectors. 

The Hawking temperature and the Bekenstein—Hawking entropy are inherently 
quantum gravitational, in the sense that they depend explicitly on both Planck’s 
constant fi and Newton’s constant G. They are also surprisingly universal. The 
entropy, for example, depends only on the horizon area, and takes the same simple 
form regardless of the black hole’s charges, angular momentum, horizon topology 
and even the number of spacetime dimensions. The thermal properties of black 
holes have not been directly observed, but by now they have been derived in 
so many different ways — from Hawking’s original calculation of quantum field 
theory in curved spacetime and Unruh’s approach of an accelerated observer to 
161,163-165 


quantum tunneling,!67168 
175-178 


Euclidean partition functions, canonical quantization in Hamiltonian for- 


166 


mulation, anomaly techniques,!©? 1” the computation 


of pair production amplitudes and many others developed since early on to 
very recently — that their existence seems very nearly certain. 

The natural question is then whether black hole thermodynamics, like the ordi- 
nary thermodynamics of matter, has a microscopic “statistical mechanical” expla- 


nation. In 1996, using methods based on D-branes and string duality in string 
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theory, Strominger and Vafa suggested a strategy of “turning down” the strength 
of the gravitational interaction until a black hole becomes a weakly coupled sys- 
tem of strings and D-branes so that at weak coupling one can count the number 
of states; this strategy led to the same Bekenstein—Hawking entropy for extremal 
(i.e. maximally charged) supersymmetric (BPS) black holes in five dimensions.'7? 
This remarkable result was quickly extended to a large number of extremal, near- 
extremal and some particular nonextremal black holes.!®°:!8! However, it becomes 
harder and uncertain to obtain the right factor 7 = 1/4 for far-from-extremal black 
holes (the Schwarzschild black hole is a typical case). This issue is currently under 
intense investigation in various approaches but has yet to be fully elucidated. 

In 2005, Mathur proposed running the analysis backwards: Starting at weak 
coupling with a particular collection of strings and D-branes, one then turns the 
gravitational coupling up and see what geometry appears at strong coupling.!®? 
The result is typically not a black hole but instead a “fuzzball” — a configuration 
with no horizon and no singularity but with a geometry akin to that of a black hole 
outside a would-be horizon.!8* In a few special cases, one can count the number of 
such classical fuzzball geometries and reproduce the Bekenstein—Hawking entropy. 
In general, however, many of the relevant states may not have classical geometric 
descriptions. 

In the context of string theory, the notable AdS/CFT correspondence also pro- 
vides a feasible scheme to derive the black hole entropy. For asymptotically Ads 
black holes, one can in principle compute the entropy by counting states in the (non- 
gravitational) dual CFT. The most straightforward application of this correspon- 
dence is for the (2+1)-dimensional BTZ black hole.!** In 1997-1998, Strominger!®* 
and Birmingham et al.!®° independently computed the BTZ black hole entropy, 
which precisely reproduces the Bekenstein—Hawking expression. Since many higher- 
dimensional near-extremal black holes have a near-horizon geometry of the form 
BTZ x trivial, the BTZ results can be used to obtain the entropy of a large class of 
string theoretical black holes.!87 

The second major research program to derive the black hole entropy from a 
microscopic picture is LQG. The suggestion that counting the quanta of area in 
LQG could offer the statistical description of black hole thermodynamics was first 
proposed by Krasnov in 1997.1°° The idea essentially is that the statistical ensemble 
is composed of the microstates of the horizon geometry that give rise to a speci- 
fied total area. By introducing the concept of “isolated horizons”, Ashtekar et al. 
rigorously developed the framework of the idea and derived the right Bekenstein— 
Hawking formula for the Schwarzschild black hole.18? 1% 
extended to other black holes with rotation, distortion, etc. 

In 2004, the combinatorial problem of counting the microscopic states in LQG 
was rephrased in a more manageable way by Domagala and Lewandowski! and by 
Meissner.!9? These works corrected a flawed assumption considered true for several 
years and enabled one to compute the formula for Ajo, >> Gh to the subleading 


These results were soon 
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order as kp iS= Er ae - In — ++++, by which the Barbero—Immirzi parameter 7 
is fixed as y = ym ®& 0.23653. For small areas, the precise number counting was first 
suggested in 2006209-70! and later thoroughly investigated by employing combina- 
torial methods (see Ref. 202 for a detailed account and Ref. 203 for a review). The 
key discovery is that, for microscopic black holes, the so-called black hole degener- 
acy spectrum when plotted as a function of the area exhibits a striking “staircase” 
structure, which makes contact in a nontrivial way with the evenly-spaced black 
hole horizon area spectrum predicted by Bekenstein and Mukhanov.?°* Recently, 
an explicit SU(2) formulation for the black hole entropy has been developed by 
Engle et al. based on covariant Hamiltonian methods,?0> 207 
port for the earlier proposal that the quantum black hole degrees of freedom could 
be described by an SU(2) Chern—Simons theory. (Also see the review Ref. 203 and 
references therein for other recent advances.) 

In addition to string theory and LQG, we now have a number of statistical 
mechanical approaches to black hole thermodynamics. These include entanglement 
entropy,7°> 2/2 induced gravity?! 75 and many others. While they may differ on 
the subleading corrections,?!° they all seem to give the correct Bekenstein-Hawking 
Ane — q In a +.---, where the coefficient 


4Gh 
a depends on the quantum theory.) A new puzzle is why such a diverse set of 


giving rigorous sup- 


entropy. (In general, one expects kp Ig = 


quantum gravitational approaches should all agree. (There are some indications 
that even the subleading corrections might be universal, up to differences on the 
treatment of angular momentum and conserved charges.?!°) 

Even more puzzling is the “information loss problem”: If a black hole is formed 
by collapse of matter in a pure quantum state, how can the seemingly thermal final 
state be compatible with unitary evolution? This question is currently the subject 
of intense research, and may yield surprising new insights into the nature of space 
and time. (For a detailed account of the information paradox, see Ref. 217.) 

For a recent review of black hole thermodynamics, see Ref. 218. 


6. Quantum Gravity Phenomenology 


Quantum gravity is often denounced as merely a theoretical enterprise that has no 
direct contact with experimental or observational realms, as quantum gravitational 
effects are appreciable only at the Planck scale, which is thought to be completely 
out of current reach. Today, as idea flourishes and technology advances drastically, 
the situation could be changed soon and quantum gravity might become reachable 
in the foreseeable future. 

Many quantum gravity models suggest departures from the equivalence prin- 
ciple, CPT symmetry and/or local Lorentz invariance in the deep Planck regime; 
among them the specific examples are string theory,?!9 braneworld scenarios,?2° 
LQG and spin foam theory,?21??? noncommutative geometry,2?? 22° 
ity,22° etc. Deviations from the symmetries at the Planck scale in turn modify the 


emergent grav- 
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pure metric spacetime structure and the local Lorentzian energy-momentum dis- 
persion relation at lower-energy scale. 

Modifications of the pure metric spacetime structure implies deviations from 
the Einstein Equivalence Principle (EEP). The most stringent test of EEP comes 
from cosmic observations. The null result of birefringence in cosmic propagation of 
polarized photons and polarized gamma rays so far has ascertained the light-cone 
structure and the core metric to ultrahigh precision of 10~°°; this high precision 
already probes the second-order in the ratio of W-boson mass or proton mass to the 
Planck mass.?2” To this precision, the only freedoms left over regarding the metric 
structure are a scalar degree of freedom (dilaton) and a pseudo-scalar degree of 
freedom (axion). The dilaton alters the amplitude while the axion rotates the linear 
polarization of the cosmic propagation. Based on the cosmic propagation since the 
last scattering surface of the cosmic microwave background (CMB), the fractional 
variation of the dilaton degree of freedom is constrained to 8 x 10~* by the precision 
agreement of the CMB spectrum to the black-body spectrum.?2° The cosmic axion 
degree of freedom is constrained by comic polarization rotation (CPR): Uniform 
CPR to less than 0.02 (rad);?2° CPR fluctuations to 0.02 (rad).??9:28° 

Observations on ultrahigh energy cosmic Rays (UHECR)??! 
kinetic energy greater than 10'5 eV, the most energetic particles ever observed — 
provide a chance to see possible modifications to the local Lorentzian energy— 
momentum dispersion relation of ultrarelativistic (v ~ c) particles. The UHECR 
spectrum measured at the Pierre Auger Observatory thus far has put very strong 


— cosmic rays with 


constraints on deviations from the Lorentzian dispersion relation.??? Future investi- 
gation with improved precision will either impose even stronger constraints or pick 
up signals of new physics including quantum gravitational effects (see Refs. 233 and 
234 for detailed surveys). 

Similarly, observations on propagation of ultrahigh energy cosmic electromag- 
netic waves can be used to test any momentum dependence of the speed of photons, 
as an indication of a breakdown of the Lorentz symmetry. Particularly, the detailed 
analysis of the relationship between the arrival time, photon momentum and red- 
shift of gamma-ray bursts is approaching very closely to the desired sensitivity to 
the Planck-scale physics.??° 

On the other hand, the ultraprecise measurements of “atom-recoil frequency” 
in the cold-atom experiments allow us to probe the energy-momentum dispersion 
relation of nonrelativistic (v < c) particles with Planck-scale sensitivity, thus pro- 
viding revealing insight into various types of modifications to the dispersion relation 
considered in the quantum gravity literature.736?3" Combining studies of the two 
complementary regimes of the nonrelativistic and ultrarelativistic dispersion rela- 
tions would tell us a great deal about the nature of quantum spacetime.??” 

In the context of astrophysics, many theories — such as attractor theory,?3° the 
DGP model,??9 massive gravity,?4° et 
in the solar system dynamics. None of these effects have been observed yet, but 


c. — predict the effects of quantum gravity 
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they might be testable in the proposed space missions designed to test relativis- 


241,242 and/or to detect and measure gravitational waves by using laser 


tic gravity 
interferometry (see Ref. 243 for a detailed survey on space-based gravitational- 
wave detectors). Sensitive tests of the equivalence principle, local Lorentz invari- 
ance, modifications to Einstein’s field equations, etc. might also be possible in the 
coming space missions.744 

In the context of cosmology, quantum effects in the pre-inflationary era may 
give rise to sufficient deviations from the standard inflationary scenario and leave 
footprints on the CMB. In string theory, various observable imprints on anisotropies 
and polarization of the CMB have been suggested by different models of string 
cosmology and by the existence of cosmic strings (see Ref. 102 for a review). In 
LQG, the standard inflationary scenario is extended to the epoch from the big 
bounce in the Planck era to the onset of slow-roll inflation.?4° Some research works 
have attempted to reveal possible observable imprints of loop quantum effects on 
the CIM. 225-25" 

There are a few experiments currently in operation or in ongoing development, 
aiming to make accurate measurements of anisotropies and polarization of the CMB, 
and their results will soon lead to great excitement. Very recently, the joint analysis 
of BICEP2/Keck Array and Planck data found no statistically significant evidence 
for tensor modes in the B-mode polarization, yielding an upper limit r < 0.12 at 
95% confidence on the tensor-to-scalar ratio r.?°? 
out many “large-r scenarios” of pre-inflationary models, including those in string 
cosmology. The conclusive result of B-mode polarization expected to arrive in the 
near future will further discriminate between different quantum gravity theories. 


This upper limit already ruled 


There are many other cosmological and astrophysical observations as well as 
laboratory experiments that could reveal possible quantum gravitational effects. 
At the turn of the centennial of the birth of Einstein’s general relativity, quantum 
gravity phenomenology will in time become an important area of relevant research. 
(For a comprehensive survey on quantum gravity phenomenology, see Ref. 253; also 
see Ref. 254 for a survey from the perspective of LQG.) 
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I argue that cosmological data from the epoch of primordial inflation is catalyzing the 
maturation of quantum gravity from speculation into a hard science. I explain why 
quantum gravitational effects from primordial inflation are observable. I then review 
what has been done, both theoretically and observationally, and what the future holds. 
I also discuss what this tells us about quantum gravity. 
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1. Introduction 


Gravity was the first of the fundamental forces to impress its existence upon our 
ancestors, because it is universally attractive and long range. These same features 
ensure its precedence in cosmology. Gravity also couples to stress-energy, which is 
why quantum general relativity is not perturbatively renormalizable,! '° and why 
identifiable effects are unobservably weak at low energies.'4 These problems have 
hindered the study of quantum gravity until recently. This paper is about how 
interlocking developments in the theory and observation of inflationary cosmology 
have changed that situation, and what the future holds. 

The experiences of two Harvard graduate students serve to illustrate the sit- 
uation before inflation. The first is Leonard Parker who took his degree in 1967, 
based on his justly famous work quantifying particle production in an expanding 
universe.'®>!” Back then people believed that the expansion of the universe had 
been constantly slowing down or “decelerating.” Parker’s work was greeted with 
indifference on account of the small particle production associated with the current 
expansion, and on the inability of a decelerating universe to preserve memories of 
early-times when the expansion rate was much higher. The ruling dogma of the 
1960s was S-matrix theory, whose more extreme proponents believed they could 
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guess the strong interaction S-matrix based on a very few properties such as ana- 
lyticity and unitarity. Through a curious process this later morphed into string 
theory. Quantum field theory was regarded as a failed formalism whose success for 
quantum electrodynamics was an accident. 

Confirmation of the Standard Model had changed opinions about quantum field 
theory by my own time at Harvard (1977-1983). However, the perturbative non- 
renormalizability of quantum general relativity led to dismissive statements such as, 
“only old men should work on quantum gravity.” The formalism of quantum field 
theory had also become completely tied to asymptotic scattering experiments. For 
example, no one worried about correcting free vacuum because infinite time evolu- 
tion from “in” states to “out” states was supposed to do this automatically. Little 
attention was paid to making observations at finite times because the S-matrix 
was deemed the only valid observable, the knowledge of which completely defined 
a quantum field theory. My thesis on developing an invariant extension of local 
Green’s functions for quantum gravity was only accepted because Brandeis Profes- 
sor Stanley Deser vouched for it. I left it unpublished for eight years.1§ 

The situation was no better during the early stages of my career. As a postdoc, 
I worked with a very bright graduate student who dismissed the quantum gravity 
community as “la-la land” and made no secret of his plan to change fields. And 
there is no denying that any number of crank ideas were treated with perfect seri- 
ousness in those days, which validated our critics. I recall knowledgeable people 
questioning why anyone bothered trying to quantize gravity in view of the classical 
theory’s success. That opinion was never viable in view of the fact that the lowest 
divergences of quantum gravity* ® derive from the gravitational response to matter 
theories which are certainly quantum, whether or not gravitons exist.!+ The differ- 
ence between then and now is that I can point to data — and quite a lot of it — 
from the same gravitational response to quantum matter. 

Today cosmological particle production is recognized as the source of the primor- 
dial perturbations which seeded structure formation. There is a growing realization 
that these perturbations are quantum gravitational phenomena,!*!%-?° and that 
they cannot be described by any sort of S-matrix or by the use of in-out quantum 
field theory.?!:?? This poses a challenge for fundamental theory and an opportu- 
nity for its practitioners, which dismays some physicists and delights others. All 
of the problems that had to be solved for flat space scattering theory in the mid 
20th century are being re-examined, in particular, defining observables which are 
infrared (IR) finite, renormalizable (at least in the sense of low energy effective field 
theory) and in rough agreement with the way things are measured.??4 People are 
also thinking seriously about how to perturbatively correct the initial state.?° 

This revolutionary change of attitude did not result from any outbreak of sobri- 
ety within the quantum gravity community, or of toleration from our colleagues. 
The transformation was forced upon us by the overwhelming data in support of 
inflationary cosmology. In the coming sections of this paper I review the theory 
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behind that data, in particular: 


Why quantum gravitational effects from inflation are observable; 
Why the tree order power spectra are quantum gravitational effects; 


Other potentially observable effects; and 


e 
e 
e Loop corrections to the primordial power spectra; 
e 
e What the future holds. 


2. Why Quantum Gravitational Effects from 
Primordial Inflation are Observable 


Three things are responsible for the remarkable fact that quantum gravitational 
effects from the epoch of primordial inflation can be observed today: 


e The inflationary Hubble parameter is large enough that quantum gravitational 
effects are small, but not negligible; 

e Long wavelength gravitons and massless, minimally coupled scalars experience 
explosive particle production during inflation; and 

e The process of first horizon crossing results in long wavelength gravitons and 
massless, minimally coupled scalars becoming fossilized so that they can survive 
down to the current epoch. 


I will make the first point at the beginning, in the subsection on the inflationary 
background. Then the subsection on perturbations discusses the second and third 
points. 


2.1. The background geometry 


On scales larger than about 100Mpc the observable universe is approximately 
homogeneous, isotropic and spatially flat. The invariant element for such a geometry 
can be put in the form, 


ds? = —c7dt? + a?(t)dx - dx. (1) 

Two derivatives of the scale factor a(t) have great significance, the Hubble param- 
eter H(t) and the first slow roll parameter «(t), 
= a — 


Inflation is defined as H(t) > 0 with e(t) < 1. One can see that it is possible 
from the current values of the cosmological parameters (denoted by a subscript 
zero), 


Ho © 2.2 x 107'8 Hz, €9 & 0.47. (3) 
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However, the important phase of inflation for my purposes is Primordial Inflation, 
which is conjectured to have occurred during the first 1073? of existence. If the 
BICEP2 detection of primordial B-mode polarization is accepted, then we finally 
know the values of H(t) and €(t) near the end of primordial inflation,?’ 


H,18x 1018 Hz, 0.013. (4) 


I will comment later on the significance of H;. Let us here note that €; is very 
near the de Sitter limit of « = 0 at which the Hubble parameter becomes constant. 
This is a very common background to use when estimating quantum effects during 
primordial inflation. 

We have direct observational evidence that both the scale factor and its loga- 
rithmic time derivative H(t) have changed over many orders of magnitude during 
cosmic history. In contrast, the deceleration parameter only varies over the small 
range 0 < e(t) < 2. Figure 1 shows what we think we know about e(t) as a function 
of the number of e-foldings since the end of primordial inflation at t = te, 


a(t) } 


a(te) 


It is a tribute to decades of observational work that only a small portion of this figure 


N(t) =In | (5) 


is really unknown, corresponding to the phase of re-heating at the end of inflation. 


BBN 
2.0 ¢ ° 
eq 
Radiation rec 
154 Domination 
e(t) 
104 ? 
0.5 ¢ 
Primordial 
Inflation 
0.0 © - o ° o o ° o o ° o 
40 -30 -20 -10 0 10 20 30 40 50 60 
N 
Fig. 1. The red curve shows the first slow roll parameter ¢(t) = —H/H? as a function of the 


number of e-foldings N since the end of primordial inflation. A question mark stands for the phase 
of reheating between the epochs of primordial inflation and radiation domination, because there are 
many models for this period. Significant events marked on the graph are Big Bang Nucleosynthesis 
(BBN) when the seven lightest isotopes were produced, matter-radiation equality (eq) when the 
energy density was composed of equal amounts of relativistic and nonrelativistic matter, and 
recombination (rec) when neutral Hydrogen formed and the cosmic microwave radiation began 
free-streaming. Observable cosmological perturbations experience first horizon crossing near the 
lower left hand corner of the graph. 
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Primordial inflation was advanced in the late 1970s and early 1980s to explain 
the absence of observed relics (primordial black holes, magnetic monopoles, cosmic 
strings) and the initial conditions (homogeneous, isotropic and spatially flat) for the 
long epoch of radiation domination which is visible on Fig. 1. After some notable 
precursors,7° 3! the paper of Guth? focused attention on the advantages of a early 
epoch of inflation and, incidentally, coined the name. Important additional work 
concerned finding an acceptable way to commence inflation and to make it end.3*34 
The first completely successful model was Linde’s “Chaotic Inflation.” °° 

One of the most powerful motivations for primordial inflation is that it explains 
the Horizon Problem of why events far back in our past light-cone seem so uniform. 
I will review the argument here because the same analysis is useful for the next sub- 
section. From the cosmological geometry (1), we can easily compute the coordinate 
distance R(t2,t1) traversed by a light ray whose trajectory obeys ds? = 0, 


R(to, ty) = | AOR (6) 


Now note the relation, 


z am 7 : f al: (7) 


One can see from Fig. 1 that €(t) was nearly constant over long periods of cosmic 


evolution, in particular during the epoch of radiation domination, which would 
extend back to the beginning if it were not for primordial inflation. So we can drop 
the second term of (7) to conclude, 

c c 
(6, —l)Hya, (eg — 1) HeaQ’ 


R(te, ti) & (8) 


One additional exact relation brings the horizon problem to focus, 


<|H(t)a(t)] = —[elt) — Halt) (9) 
Combining Eq. (9) with (8) reveals a crucial distinction between inflation (€(t) < 1) 
and deceleration (€(t) > 1): During deceleration the radius of the light-cone is 
dominated by its upper limit, whereas the lower limit dominates during inflation. 
The horizon problem derives from assuming that there was no phase of primordial 
inflation so that the epoch of radiation domination extends back to the beginning 
of the universe. Suppose that the universe began at t = tg and we view some 
early event such as recombination (rec on Fig. 1) or big bang nucleosynthesis 
(BBN on Fig. 1). At time t = ¢; we can see things out to the radius of our past 
light-cone R(t;, to) which is vastly larger than the radius of the forward light-cone 
R(t2,t1) © c/[(e. — 1) Ha] that anything can have traveled from the beginning of 
time. For example, the cosmic microwave radiation is uniform to one part in 10°, 
which is far better thermal equilibrium than the air of the room in which you are 
sitting. Without a phase of primordial inflation we are seeing about 2200 different 
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patches of the sky which have not even had time to exchange a single photon, much 
less achieve a high degree of thermal equilibrium.'* Of course the problem just gets 
worse the further back we look. At the time of big bang nucleosynthesis BBN we 
are seeing about 10!° causally disconnected regions, which are nonetheless in rough 
thermal equilibrium.!4 

Without inflation the radius R(t2,t1) of the forward light-cone is almost inde- 
pendent of the beginning of time t2. No matter how early we make fg, it is not 
possible to increase R(t2,t1) more than about c/|(e, — 1)Hiai]. Hence, the high 
degree of uniformity we observe in the early universe would have to be a spectac- 
ularly unlikely accident. Primordial inflation solves the problem neatly by making 
the lower limit of the forward light-cone dominate, R(t2, t1) & c/[(1— €2)H2a2]. We 
can make the radius of the forward light-cone much larger than the radius of the 
past light-cone, so that causal processes would have had plenty of time to achieve 
the high degree of equilibrium that is observed. 

Before closing this subsection, I want to return to the numerical values quoted for 
Hp and H; in relations (3)—(4). The loop counting parameter of quantum gravity can 
be expressed in terms of the square of the Planck time, T?, = hG/c? © 2.9x 10787 s?. 
Quantum gravitational effects from a process whose characteristic frequency is w 
are typically of order w?T},. For inflationary particle production the characteristic 
frequency is of course the Hubble parameter, so we can easily compare the strengths 
of quantum gravitational effects during the current phase of inflation and from the 
epoch of primordial inflation, 


hGH 


= 1.4 x 107, 
Cc 


aoax 10-", (10) 


hGH? 
@ 


The minuscule first number is why we will never detect quantum gravitational 
effects from the current phase of inflation. Although the second number is tiny, it is 
not so small as to preclude detection, if only the signal can persist until the present 
day. In the next subsection, I will explain how that can happen. 

The loop counting parameter hGH?/c° is the quantum gravitational analog of 
the quantum electrodynamics fine structure constant a = e?/4meolic © 7.3 x 107%. 
Both parameters control the strength of perturbative corrections. Recall that a 
result in quantum electrodynamics — for example, the invariant amplitude of 
Compton scattering — typically consists of a lowest, tree order contribution of 
strength a, then each additional loop brings an extra factor of a. In the same way, 
the lowest, tree order quantum gravity effects from inflationary particle production 
have strength hG'H?/c°, and each addition loop brings an extra factor of hGH?/c?. 
Because the quantum gravitational loop counting parameter from primordial infla- 
tion is so much smaller than its quantum electrodynamics cousin, we expect that 
quantum gravitational perturbation theory should be wonderfully accurate. In fact, 
all that can be resolved with current data is the tree order effect, although I will 
argue in Sec. 4.4 that the one loop correction may eventually be resolved. Beyond 
that there is no hope. 
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2.2. Inflationary particle production 


The phenomenon of polarization in a medium is covered in undergraduate electro- 
dynamics. The medium contains a vast number of bound charges. The application 
of an electric field makes positive charges move with the field and the negative 
changes move opposite. That charge separation polarizes the medium and tends to 
reduce the electric field strength. 

One of the amazing predictions of quantum field theory is that virtual particles 
are continually emerging from the vacuum, existing for a brief period, and then 
disappearing. How long these virtual particles can exist is controlled by the energy— 
time uncertainty principle, which gives the minimum time At needed to resolve and 
energy difference AF, 


AtAE > hi (11) 


If one imagines the emergence of a pair of positive and negatively charged par- 
ticles of mass m and wave vector +k then the energy went from zero to E = 
2[m2ct + h2c2k?]2. To not resolve a violation of energy conservation, the energy— 
time uncertainty principle requires the pair to disappear after a time At given by, 


A 
At ~ —==————.. (12) 
Vmict + h2ctk? 
The rest is an exercise in classical (that is, nonquantum) physics. If we ignore the 
change in the particles’ momentum then their positions obey, 


2 +h?eE 
é (/m?c? + h?k? Axi) = +eE > Axs(At) = : . (13) 


edt? Qe(m2c2 + h2k?]8 


Hence the polarization induced by wave vector k is, 
he?E 


= +eAx, (At) — eAx_ (At) = ——————_—__... 
p +(At) () = Toa eel 


(14) 
The full vacuum polarization density comes from integrating d?k/(27)?. 

The simple analysis I have just sketched gives pretty nearly the prediction from 
one loop quantum electrodynamics, which is in quantitative agreement with exper- 
iment. It allows us to understand two features of vacuum polarization which would 
be otherwise obscure: 


e That the largest effect derives from the lightest charged particles because they 
have the longest persistence times At and therefore induce the greatest polariza- 
tion; and 

e That the electrodynamic interaction becomes stronger at short distances because 
the longest wavelength (hence smallest &) virtual particles could induce more 
polarization than is allowed by the travel time between two very close sources. 


Cosmological expansion can strengthen quantum effects because it causes the vir- 
tual particles which drive them to persist longer. This is easy to see from the 
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geometry (1). Because spatial translation invariance is unbroken, particles still have 
conserved wave numbers k. However, because the physical distance is the coordi- 
nate distance scaled by a(t), the physical energy of a particle with mass m and 
wave number k = 27/ becomes time-dependent, 


h2c2k? 
a?(t) 


Hence, the relation for the persistence time At of a virtual pair which emerges at 
time t changes from (12) to, 


E(t, k) = 4/mct + 


(15) 


t+At 
/ dt' E(t',k) ~ h. (16) 
t 


Massless particles persist the longest, just as they do in flat space. However, for 
inflation it is the lower limit of (16) which dominates, so that even taking At to 
infinity does not cause the integral to grow past a certain point. One can see this 
from the de Sitter limit, 


t+At 
hek heck 
! _ —H;At 
i dt alt’) — “a(t [l—e |. (17) 


A particle with ck < H(t)a(t) is said to be super-horizon, and we have just shown 
that any massless virtual particle which emerges from the vacuum with a super- 
horizon wave number during inflation will persist forever. 

It turns out that almost all massless particles possess a symmetry known as 
conformal invariance which suppresses the rate at which they emerge from the 
vacuum. This keeps the density of virtual particles small, even though any that do 
emerge can persist forever. One can see the problem by specializing the Lagrangian 
of a massless, conformally coupled scalar w(t, x) to the cosmological geometry (1), 


a EB Awdib aan 
a 


i R 
L= —sduhdog’ /—9 Thad g 


2 a? c? 
(18) 


The equation for a canonically normalized, spatial plane wave of the form w(t, x) = 
u(t, k)e**'* can be solved for a general scale factor a(t), 


ex } ef ae 
RL fh, alt) 
2ck a(t) 


(19) 


+ H 4 21] v=0= ult,k) = 


The factor of 1/a(t) in (19) suppresses the emergence rate, even though destructive 
interference from the phase die off, just as the energy—-time uncertainty principle 
(17) predicts. The stress-energy contributed by this field is, 


1 1 1 
Ty — bar ~ a OpWOs 6 Ln Iu kt tol D,D.| wv, (20) 
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where D,, is the covariant derivative and O is the covariant d’Alembertian. We 
can get the zero-point energy of a single wave vector k by specializing Tog to the 
cosmological geometry (1) and multiplying by a factor of a°(t), 
3 27.2 
a’ |. ck ee fed hek 

E(t,k) = 5 ier ( 72 ) |u|? + H(vo* + ov*)| = alt) (21) 
This is just the usual hw term which is not strengthened but rather weakened by 
the cosmological expansion. 


Only gravitons and massless, minimally coupled scalars are both massless and 
not conformally invariant so that they can engender significant quantum effects 
during inflation. Because they obey the same mode equation®®:?" it will suffice to 
specialize the scalar Lagrangian to the cosmological geometry (1), 

i 1 ole? 1 
L=—=0,60,¢9"" /-g > 50 E 2 avi] (22) 


2 Cc? 


The equation for a canonically normalized, spatial plane wave of the form ¢(t,x) = 
u(t, k)e"*’* is simpler than that of its conformally coupled cousin (19) but more 
difficult to solve, so I will specialize the solution to de Sitter, 


242 i f | ia() id [ier [ “t) 


c 
a eee th) = 

w+ 3Hu+ ae 0 => u(t, k) Suk 

(23) 

The minimally coupled mode function u(t,&) has the same phase factor as the 

conformal mode function (19), and they both fall off like 1/a(t) in the far sub- 

horizon regime of ck >> H,a(t). However, they disagree strongly in the super-horizon 


regime during which v(t, &) continues to fall off whereas u(t, ) approaches a phase 
times H;,/h/2c?k?. One can see from the equation on the left of (23) that u(t, k) 
_ heck 


approaches a constant for any inflating geometry. 
1 ck 1 H,a(t) \* 
E(t,k) = =a® } |u|? + ——-|ul?| = + | — 24 
ah) 9" G = a? fe a(t) | 2 2ck ae) 
Because each wave vector is an independent harmonic oscillator with mass propor- 


tional to a3(t) and frequency k/a(t) we can read off the occupation number from 
expression (24), 


The zero-point energy in wave vector k is, 


(25) 


N(t,k) = Ecle 


2ck 
As one might expect, this number is small in the sub-horizon regime. It becomes 
of order one at the time t, of horizon crossing, ck = H(t,)a(t,), and N(t,k) grows 
explosively afterward. This is crucial because it means that inflationary particle 
production is an IR effect. That means we can study it reliably using quantum 
general relativity, even though that theory is not perturbatively renormalizable. 
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The final point I wish to make is that the mode function u(t, &) becomes constant 
after first horizon crossing. For de Sitter this constant is calculable, 


ick 
exp| ihe 5 (26) 


However, one can see from the mode equation on the left-hand side of (23) that the 
approach to a constant happens for any inflating geometry. Recall from equation 
(9) that the inverse horizon length c~'H(t)a(t) grows during inflation and decreases 
during the later phase of deceleration which encompasses so much of the cosmolog- 
ical history depicted in Fig. 1. Hence, we can give the following rough summary of 
the “life cycle of a mode” of wave number k: 


h 
u(t, Blas > Hay! ar 


e At the onset of primordial inflation the mode has ck >> H(t)a(t) so the mode 
function oscillates and fall off like 1/a(t); 

e If inflation lasts long enough the mode will eventually experience first horizon 
crossing ck = H(t,)a(t,), after which mode function becomes approximately 
constant; and 

e During the phase of deceleration which follows primordial inflation, modes which 
experienced first horizon crossing near the end of inflation re-enter the horizon 
ck = H(T),)a(T;,), after which they begin participating in dynamical processes 
with amplitude larger by a factor of a(T;,)/a(t,) than they would have had with- 
out first horizon crossing. 


This is how quantum gravitational effects from the epoch of primordial inflation 
become fossilized so that they can be detected now. 


3. Tree Order Power Spectra 


Although the evidence for primordial inflation is overwhelming, there is not yet 
any compelling mechanism for causing it. The simplest class of successful models 
is based on general relativity plus a scalar inflaton y(x) whose potential V(w) is 


regarded as a free function,°° 
AR 1 
= | —— - - Vv ae V = 27 
or PnP v8 (¢)| V-8 (27) 


Here g,,,(x) is the D-dimensional, spacelike metric with Ricci curvature R. (I will 
work in D spacetime dimensions to facilitate the use of dimensional regularization, 
even though the D = 4 limit must eventually be taken for physical results). The 
purpose of this section is to show how this simple model can not only drive primor- 
dial inflation but also the quantum gravitational fluctuations whose imprint on the 
cosmic background radiation has been imaged with stunning accuracy.2°38 4° 

I first demonstrate that the potential V(y) can be chosen to support the cos- 
mological geometry (1) with any scale factor a(t) for which the Hubble parameter 
is monotonically decreasing. I also comment on the many problems of plausibility 
which seem to point to the need for a better model. I then decompose perturbations 
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about the background (1) into a scalar ¢(x) and a transverse-traceless tensor h,; (x). 
Owing to the particle production mechanism adumbrated in Sec. 2.2, certain modes 
of ¢ and hy; become hugely excited during primordial inflation, and then freeze-in 
so that they can survive to much later times. The strength of this effect is quantified 
by the primordial scalar and tensor power spectra, which I define and compute at 
tree order, along with associated observables. I then discuss the controversy which 
has arisen concerning an alternate definition of the tree order power spectra. The 
section closes with an explanation of why the tree order power spectra are the first 
quantum gravitational effects ever to have been resolved. 

I will adopt the notation employed in recent studies by Maldacena*! and by 
Weinberg,”! however, the original work for tensors was done in 1979 by Starobin- 
sky,4? and for scalars in 1981 by Mukhanov and Chibisov.*? Important subsequent 
work was done over the course of the next several years by Hawking,*4 Guth and 
Pi,*® Starobinsky,*° Bardeen et al.4” and by Mukhanov.*® Some classic review arti- 
cles on the subject are Refs. 49-51. 


3.1. The background for single-scalar inflation 


There is no question that a minimally coupled scalar potential model of the form 
(27) can support inflation because there is a constructive procedure for finding the 
potential V() given the expansion history a(t).°? °° For the geometry (1) the scalar 
depends just on time yo(t) and only two of Einstein’s equations are nontrivial, 


i 8G [ o 
3D-900-P = Bev], es) 
_ 1 8nG [oe 
(D- 2) - 5(D-2)(D- 1H? = = |B—vige)]. 2s) 
By adding (28) and (29) one obtains the relation, 
. 8rG , 
—(D-2)H = a #0: (30) 


Hence, one can reconstruct the scalar’s evolution provided the Hubble parameter 
is monotonically decreasing, and that relation can be inverted (numerically if need 
be) to solve for time as a function of 


yo(t) = yo(te) + | wy — Sen Ds T (0): (31) 


One then determines the potential by subtracting (29) from (28) and evaluating 
the resulting function of time at t = T(yo), 
(D — 2)? 
V = 
(90) = TeG 
Just because scalar potential models (27) can be adjusted to work does not mean 
they are particularly plausible. They suffer from six sorts of sometimes contradictory 


[H(T'(vo)) + 3H?(T(¥o))}.- (32) 
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fine-tuning problems: 


1) Initial Conditions — Inflation must begin with the inflaton approximately 
homogeneous, and potential-dominated, over more than a Hubble volume”®; 

2) Duration — The inflation potential must be flat enough to make inflation last 
for at least 50 e-foldings®**4; 

3) Scalar Perturbations — Getting the right magnitude for the scalar power spec- 
trum requires hG?/c!! x V3/V’? ~ 1071" (Ref. 49); 

4) Tensor Perturbations — Getting the right magnitude for the tensor power spec- 
trum requires c4/G x (V’/V)? ~ 1 (Ref. 49); 

5) Reheating — The inflation must couple to ordinary matter (its gravitational 
couplings do not suffice) so that its post-inflationary kinetic energy produces a 
hot, radiation-dominated universe’; 

(6) Cosmological Constant — The minimum of the scalar potential must have the 


right value AG?Vnin/c’ ~ 10-173 to contribute the minuscule vacuum energy 
58-61 


we observe today. 


Note that adding the matter couplings required to produce reheating puts 2-4 
at risk because matter loop effects induce Coleman—Weinberg corrections to the 
inflation effective potential. Nor does fundamental theory provide any explanation 
for why the cosmological constant is so small.°?°3 The degree of fine-tuning needed 
to enforce these six conditions strains credulity, and disturbs even those who devised 
the early models.®* 6” 

Opinions differ, but I feel it is a mistake to make too much of the defects of 
single-scalar inflation. The evidence for an early phase of accelerated expansion is 
overwhelming and really incontrovertible, irrespective of what caused it. Further, 
all that we know about low energy effective field theory confirms that the general 
relativity portion of Lagrangian (27) must be valid, even at the scales of primordial 
inflation. That suffices to establish the quantum gravitational character of primor- 
dial perturbations, even without a compelling model for what caused inflation. So 
I will go forward with the analysis on the basis of the single-scalar model (27), firm 
in the belief that whatever eventually supplants it must exhibit many of the same 
features. 


3.2. Gauge-fixed, constrained action 


We decompose g,,, into lapse, shift and spatial metric according to Arnowitt, Deser 
and Misner (ADM),°°-7° 


Swvde"dz’ = —N*c*dt? + gi;(dax* — N*cdt)(dx* — N*cdt). (33) 


ADM long ago showed that the Lagrangian has a very simple dependence upon the 
lapse," 


4 
£ = (Surface Terms) — “vi [w -A+ x (34) 
TT 
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The quantity A is a potential energy, 


167G 1 s 
A=-R+ 3 | vip) + jeuwsa', (35) 


where R is the (D — 1)-dimensional Ricci scalar formed from g;;. The quantity B 
in (34) is a sort of kinetic energy. 


. a 87rG (¢ AF 
Bah) BP By = (2 = ean) (36) 
c c 
where E;;/N is the extrinsic curvature, 
1 ae 
Bij = 5(Nig + Nia — ' 953] (37) 


and a semicolon denotes spatial covariant differentiation using the connection com- 
ADM fix the gauge by specifying N(t,x) and N*(t,x), however, Maldacena*! 
and Weinberg?! instead impose the conditions, 


Golt, x) = vt, x) = o(t) =0, (38) 
G;(t, x) = Oj hij (t, x) = 0. (39) 


The transverse-traceless graviton field hj; (t, x) is defined by decomposing the spatial 
metric into a conformal part and a unimodular part ij, 


gig = a (te OG, (t,x) > /g =aPtelP-US, (40) 


The unimodular part is obtained by exponentiating the transverse-traceless graviton 
field hij (t, x), 

Gij = [e" Ji = Oi t hij t shieh jp } O(h?), hg =0. (41) 
The Faddeev—Popov determinant associated with (38), (39) depends only on hjj;, 
and becomes singular for ¢ = 0. Recall that H(t) = a@/a and e(t) = —H/H?. 

Of course, no gauge can eliminate the physical scalar degree of freedom which 
is evident in (27). With condition (38) the inflation degree of freedom resides in 
¢(t, x) and linearized gravitons are carried by hi; (t,x). In this gauge the lapse and 
shift are constrained variables which mediate important interactions between the 
dynamical fields but contribute no independent degrees of freedom. Varying (34) 
with respect to N produces an algebraic equation for N, 


A-<5=05N= Te (42) 


al 


This gives the constrained Lagrangian a “virial” form, 
g grang 


Leonst. = (Surface Terms) — CVI AB, (43) 
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From relations (30-32) one can see that the background values of the potential 
and kinetic terms are equal, 4g = By = (D — 2)c~?[H + (D — 1)H?]. Hence, the 
background value of the lapse is unity. 

There is unfortunately no nonperturbative solution for the shift N*(t, x) in terms 
of ¢ and hij, so its constraint equation must be solved perturbatively. One first 
employs (40) to exhibit how the potential (35) depends on ¢ and hj;, 


A= Aj —-R=Ad(1 +a). (44) 


Here the spatial Ricci scalar is, 


R= Sik 2(D 2)V2¢ (O— 2 —2)e" 9" es), (45) 


where R = O(h?) is the Ricci scalar formed from Gij and V2 = ag 0; is the 
conformal scalar Laplacian. The full scalar Laplacian is, 


2 Loe aig. 
V? = ala] (46) 


At this stage one can recognize that the dimensionless three-curvature perturbation 
R is just ¢, in D = 4 dimensions and to linearized order,*” 


_ 1, (D-2 
R(t,x) = ie (77) 6G x) OC fh, BF): (47) 


The kinetic energy (36) can be expressed as, 


B = Ao(1+ 8) = Ao + 2(D — 2)e" 1 A[(D — 1)(e7 1 — C.N*) — N*,] — EM Ege 


Mp= 2 p=) (é cuit] D2) (: cat) hy a 


(48) 
Here we define Ni = N’, N; = Gi; NI and Ei; = LING. } Nj. otha]. 
The next step is to expand the volume part of the constrained Lagrangian in 
powers of a and 8, 


4 44D-1e(D-1¢ 

7 Vt AB = = ae Ap V/(1 + a)(1 + 8) 49) 
 ctaP-te(D=106 _(a+8) (a-8) , 
= 8nG Ao! “Ss 8 “} me 


As Weinberg noted, the terms involving no derivatives of ¢ or hij sum up to a total 
derivative,?! 


a Age SD =D)He? 4), (51) 
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Another important fact is that quadratic mixing between Ni and ¢ can be elimi- 
nated with the covariant field redefinition,” 
ce— 76 wid 
Ha? 


Sans Mos Cae cai), (52) 


After much work the quadratic Lagrangians emerge, 


@qP-} Sh Sh D—3 +e€ ak ae 
Loo = 32nG {ars OeS™ + (5) OeS” ORS \ (53) 
_ (D—2)ctea?—! C2 OnCOnG 
a 167G Ce a? ; en 
ca?! high; Op hig On hrig 
Lia = a 5 ze (55) 


These results suffice for the analysis of this section. To consider loop corrections 
(or non-Gaussiantity) one must solve the constraint equation for S*, 


D-3+¢e 


0505S" + (5) 0:0;57 = O(¢?, Ch, h?, SC). (56) 
= SE 
That is a tedious business which has only been carried out to a few orders. I will 


review what is known in Sec. 4.1. 


3.3. Tree order power spectra 


As we will see in Sec. 4.3, there is not yet general agreement on how to define the 
primordial power spectra when loop corrections are included.?*;*4 At tree order we 
can dispense with dimensional regularization, and also forget about the distinction 
between ¢ and the dimensionless three-curvature perturbation (47). The following 
definitions suffice: 


k3 

AR(k,t) = 5 f Bae ™™*(QIC(t, x)C(t, 0)|9), (57) 
3 

A? (k,t) = a dx e—** (Ohi, (t, x) hi; (t, 0)|Q). (58) 


Although it is useful to retain the time dependence in expressions (57), (58), the 
actual predictions of primordial inflation are obtained by evaluating the time- 
dependent power spectra safely between the first and second horizon crossing times 
t, and T;, described in Sec. 2.2, 


AR(k) =AR(kHltie<ecr,, Az(k) = ARK, tl cre: (59) 
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The state |Q) in expressions (57), (58) is annihilated by a(k) and ((k, A) in 
the free field expansions of ¢ and hj,;, 


3 
¢(t,x) = | are k)e™**a(k) + v*(t, k)e™**al (k)}, (60) 


3 
igi) = l= 2 {ult k)e™**e,5(k, A) B(k, A) + Conjugate}. (61) 


The polarization tensors €;;(k, A) are the same as those of flat space. If we adopt 
the usual normalizations for the creation and annihilation operators, 


[a(k), af (k’)] = (21)?5°(k—k’),  [B(k, A), al (k’, A’)] = day(2m)°57°(k —k’) (62) 


then canonical quantization of the free Lagrangians (54), (55) implies that the mode 
functions obey, 


€ ek? e ,  tAthG 
v + si { | U ae Qi 0, vu — vu = eae ’ (63) 
? a ee ye 1820hG 
a+ 3Hu+ Gz =0, u* -—wu* = qs (64) 


It has long been known that the graviton mode function u(t,k) obeys the same 
Eq. (23) as that of a massless, minimally coupled scalar.*°3" Only their normaliza- 
tions differ by the square root of 327G'/c?. 

By substituting the free field expansions (60), (61) into the definitions (57), (58) 
of the power spectra, and then making use of the canonical commutation relations 
(62), one can express the tree order power spectra in terms of scalar and tensor 
mode functions, 


k#|u(t, k)| 

Aik) = 

Rk, ) O72 ) (65) 
k3 u(t, k)|? 4 k3|u(k, t)|? 

A? (k,t) = fe BIE (t >, ea; = cal v JF (66) 


A=+ 


One of the frustrating things about primordial inflation is that we do not know what 
a(t) is, so we need results which are valid for any reasonable expansion history. This 
means that even tree order expressions such as (65), (66) can only be evaluated 
approximately because there are no simple expressions for the mode functions for 
a general scale factor a(t).”2-™4 

One common approximation is setting «(¢) to a constant, the reliability of which 
can be gauged by studying the region (at N ~ —40) of Fig. 1 at which currently 
observable perturbations freeze-in. (The necessity of nonconstant e€(t) later is not 
relevant for the validity of assuming constant ¢(t) to estimate the amplitude at 
freeze-in.) For constant ¢«(t) both mode functions are proportional to a Hankel 
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function of the first kind, 


7 7 87?hG (a) ck eee (ie 
(t, k) Va — A) eH (tar(t)” € = one , 2 ( - :). on 


Between first and second horizon crossing (t, < t < T;,) we can take the small 
argument limit of the Hankel function, 


e(t) = 0, th«t<T; 


8n7hG —il(v) [2(1— 6) H(t)a(t)]” 
woe) = fp He) [20 = Htal)” 6 


/167?hG  —iT(v) [2(1 — €)A(t)a‘(t) Tm 
= oe (ck)é | , ae 


Constant ¢(t) also implies H(t)a‘(t) is a constant, which we may as well evaluate 
at the time of first horizon crossing, H(t)a‘(t) = H!~‘(t,)(ck)*. With the doubling 
formula (I'(2x) = 2?°-1/./m x T'(a)P'(a + $)) we at length obtain, 


1 
: i ele= 2G) 16nhGH? (ty) 
et)=0, th <Xt<T, => ult,k) & i| c | (1) x BES : 


(71) 


The factor multiplying the square root has nearly unit modulus for small « — and 
the BICEP2 result is ¢; = 0.013,2” while previous data sets give the even smaller 
bound of e; < 0.007 at 95% confidence.** *° Hence it should be reliable to drop this 
factor, resulting in the approximate forms, 


hGH? (tp) 16hGH? (ty) 
2 es BIE cs 
BRE) ae) 2 BR 5 (72) 


The Wentzel-Kramers—Brillouin (WKB) approximation is another common 
technique for estimating the freeze-in amplitudes of u(t, k) and u(t, k) which appear 
in expressions (65), (66) for the tree order power spectra. Recall that the method 
applies to differential equations of the form f + w(t) f = 0. From expression (64) 
one can see that reaching this form for the tensor mode function requires, the rescal- 
ing f(t, k) = a2(t) x u(t,k). It is then simple to recognize the correctly normalized 
WKB solution and its associated frequency as, 


167hG WT ak ad 
u(t,k) & Fai(f) w(t, b)) exp] if dt'w(t 1), (73) 
2 _ eh 9 3 2 
w(t,k) = A E 5e() A(t). (74) 
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In the sub-horizon regime of ck >> H(t)a(t) the frequency is real w(t, k) = ck/a(t) 
and the solution (73) both oscillates and fall off like 1/a(t). Freeze-in occurs in the 
super-horizon regime of ck < H(t)a(t) during which the frequency is imaginary 
w(t,k) © iH(t)[3 — 4e(t)]. We can estimate the freeze-in amplitude by computing 
the real part of the exponent, 


nym fg) 84@) LHe) _ | Ft OHFO 
i f aren = fa iio | AO a - ae al (75) 


Substituting (75) in (73) and using H(t,)a(t,) = ck gives the following result for 
the freeze-in modulus, 


167hG H? (ty, 
th Kt <T = |u(t, k) we © OGG) (76) 
3. 6C(d 
ks 3 = 5) 


If we drop the order one factor of |2 — $e(t)| the result is (72). 

From expression (63) we see that reaching the WKB form for the scalar mode 
function requires the rescaling f(t,k) = €2(t) x a2(t) x v(t,k). Its frequency is 
simpler to express if I first introduce the (Hubble form of the) second slow roll 
parameter 7(t), 

= e(t) 
nl) = €) ~ say (77) 
The correctly normalized WKB approximation for the scalar mode function and its 
frequency is, 


27hG . ‘ / i: 
nel | =asaRen oni fate] — 


242 3 3 é-7 
‘Cha t e(t) — n(t t)) 4 H(t). (79 
Pbk) = Sa -|(F +e m0) (F-n0) + Ft] rw. (79) 
Freeze-in occurs as one evolves from the sub-horizon regime of v(t, k) = ck/a(t) to 
the super-horizon regime of v(t, k) + iH(t)[$ + 5e(t) — n(t)]. The real part of the 
exponent of (78) is, 


3a(t!) 1H(t’) . 1&(t’) 
—4 ‘U(t!,k) © 
ff ree [ si tee af) * 2H(#) * De(#) 
3 1 1 
a2(t)H2 (t)e2(t 
a2 (ty, )H2 (ty, )e2 (th) 
Substituting in (78) gives a freeze-in modulus which is again roughly consistent 
with (72), 


QrhG H? (tx) 
Ok (th) 


th <t< Th = |v(t,k)|wxs © 


(81) 
2 


5+ 5el0) — 0) 
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It should be obvious from the discordant factors of order one in expressions 
(71), (76) and (81), that the results (72) for the tree order power spectra are only 
approximate. This is confirmed by numerical integration of explicit models.”'”° In 
addition to order one factors, depending on the geometry at t = t, there is also a 
nonlocal “memory factor” which depends on the precise manner in which the mode 
evolves up to first horizon crossing.“ Most of the ambiguity derives from not having 
a definitive model for what caused inflation. Once the expansion history is known 
it is possible to derive wonderfully accurate results by numerically integrating the 
mode functions.” It is even more efficient to numerically evolve the time-dependent 
power spectra directly, without the irrelevant phase information.” 

The tree order power spectra (59) give the tensor-to-scalar ratio r(k), the scalar 
spectral index n,(k), and the tensor spectral index n;(k), 


_ ARR) 
nh) Si ete ae 4 onlb,) (83) 


ne(k) = a) mw —2e(ty). 


In each case the definition is exact, and the approximate result derives from expres- 


(84) 


sions (72) with an additional approximation to relate dk to dt,, 
cdk = [1 — €(t,)|H?(ty)a(ty)dt, © ck x H(ty)dty. (85) 


Comparison of (82) and (84) implies an important test on single-scalar inflation 


which is violated in more general models,” *! 


re —8ne. (86) 
Certain general trends are also evident from the approximate results (72): 


e r <1 because € <1; 
e n: <0 because H(t) decreases; and 
en; —1< mm because e(t) tends to increase. 


To anyone who works in quantum gravity it is breath-taking that we have any data, 
so it seems petulant to complain that the some of the parameters, and particularly 
their dependences upon k, are still poorly constrained. The scalar power spectrum 
can be inferred from the measurements of the intensity and the E-mode of polariza- 
tion in cosmic microwave radiation which originates at the time of recombination 
(rec on Fig. 1) and then propagates through the fossilized metric perturbations left 
over from the epoch of primordial inflation. The latest full-sky results come from 
the Planck satellite and are fit to the form,”® 


i ns—l+ gas In( 5) 
) (87) 


A? (k) & As (= 
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The fiducial wave number is ko = 0.050 Mpc”', and the quantities “n,” and 
“dn;/d\n(k)” are the scalar spectral index and its derivative evaluated at k = ko. 
When combined with the polarization data from the WMAP satellite®® the Planck 
team reports,?° 


10? x A, = 2196" Steg a = 0.0603 + 0.0078, 


dns (88) 
— = —0.013+ 0.018. 
din(h) 0.013 + 0.018 


The tensor-to-scalar ratio is reported at a much smaller wave number of k = 


0.002 Mpc~'. Bounds on 19,992 can be derived from analyzing how the intensity and 
the E-mode of polarization of the cosmic microwave background radiation depend 
upon k. Because this sort of & dependence might also indicate “running” of the 
scalar spectral index (dn,/dln(k) 4 0), the limits on 19.992 become significantly 
weaker if one allows for running. Combining Planck and previous data sets gives 


the following bounds at 95% confidence,?® 
dns 
mal = 
ro.oo2 < 0.11, ine 0, (89) 
<0.26, 40 (90) 
df fs - 
0.002 ’ dIn(k) 


Direct detection requires a measurement of the B-mode of polarization. The 
BICEP2 team have done this and they report a result consistent with, 


T0.002 ~*~ 0.20. (91) 


Although the tensor power spectrum is still poorly known and controversial,8?/53 


resolving it (and ruling out alternative interpretations**) is terrifically important 
because it tests single-scalar inflation through relation (86) and incidentally fixes 
the scale of primordial inflation. If only AZ (k) is resolved one can always construct 
a single-scalar potential V(y~) which will explain it. To see this, suppose we have 
measured the scalar power spectrum for some range of wave numbers kz < k < ky 
and use the approximate formula (72), along with the small € relation (85) between 
dt, and dk, to reconstruct the inflationary Hubble parameter, 


1 1 2hG [ dk! 
HG) HE ae? Jy WAAR). 


(92) 


The Hubble parameter Hz = H(t2) is an integration constant which we can choose 
to make the tensor power spectrum smaller than any bound. Now use (92), with 
(85), to reconstruct the relation between t and k, 


k / 2 k/ 
dk 2hGH. dk" 
Ho(t — tz) = 14 2 ; 
a(t — ta) ij. a mee a ie? 2 Ce") 3) 


This expression can always be inverted numerically, and the rest of the construction 
is the same as that given in Sec. 3.1. 
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3.4. The controversy over adiabatic regularization 


It is obvious from their free Lagrangians (60)—(55) that the two-point correlators of 
¢(t,x) and h,;(t,x) diverge quadratically when the two fields are evaluated at the 
same spacetime point. This is not enough to induce any tree order divergence in my 
definitions (57), (58). However, it is problematic for the more common definition 
which is based on a spectral resolution of the coincident two-point function, 


(alc(txyett.x)I0) = fF veces, (94) 
(Ml has(Erha(t, x90) = [FE azine) (95) 


In 2007 Parker,®° pointed out that removing this divergence with the standard 
technique of adiabatic regularization!®8° 69 
orders of magnitude. 


can change the power spectra by several 


Subsequent work by Parker and his collaborators showed that adiabatic regu- 
larization of the scalar and tensor power spectra would alter the single-scalar con- 
sistency relation (86) and would also reconcile the conflict between even WMAP 
data®® and a quartic inflation potential V(y) = Ay*.9°! Such profound changes 
in the labor of three decades provoked the natural objection that no technique for 
addressing ultraviolet divergences ought to affect the IR regime in which inflation- 
ary particle production takes place.°?:°? Parker and his collaborators replied that 
consistency of renormalization theory requires adiabatic subtractions which affect 
all modes, including those in the IR.%* 

I find this debate fascinating because it is an example of how inflationary cos- 
mology is challenging the way we think about, hitherto abstract issues in quantum 
gravity and viceversa. I do not know the answer, but I have encountered the same 
problem when trying to work out the pulse of gravitons which would be produced by 
avery peculiar model in which H(t) oscillates from positive to negative at the end of 
inflation.°?.°* The resolution may not lie with any change in the way we renormalize 
but rather with greater care in how we connect theory to observation, for example, 
defining the tree-order power spectra from expressions (57), (58) rather than from 
spectral resolutions of the coincident two-point functions (94), (95).°” Whatever we 
find, it is worthwhile to reflect on the wonder of what is taking place. These are the 
same problems which the men of genius who founded flat space quantum field the- 
ory had to puzzle out when they settled on noncoincident one-particle-irreducible 
(1PI) functions as the basis for renormalization and computation of the S-matrix. 
It is a privilege to reprise their roles. 


3.5. Why these are quantum gravitational effects 


The factors of hG' in expressions (72) ought to establish that both power spectra 
are legitimate quantum gravitational, the first ever detected. Unfortunately, three 
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objections seem to be delaying general recognition of this simple but revolutionary 
fact: 


e Expressions (72) are tree order results; 
e There is still debate over whether or not the graviton signal A?(k) has been 
resolved?”82:83. and 


e There is not yet a compelling model for what caused primordial inflation. 


I will argue below that all three objections result from imposing unreasonably high 
standards on what qualifies as a quantum gravitational effect. 

The first objection might be re-stated as, “it’s not quantum gravity if it’s only 
tree order.” This is applied to no other force. For example, both the photo-electric 
effect and beta decay occur at tree order, yet no one disputes that they are quantum 
manifestations of the electroweak interaction. The same thing could be said of 
Planck’s blackbody spectrum, and any number of other tree order effects such as 
Bhabha scattering. 

The second objection might be restated as, “it’s not quantum gravity if it doesn’t 
involve gravitons.” This is also silly because AZ,(k) has certainly been resolved 
and it is just as certainly a quantum gravitational effect in view of the factor of 
AG evident in expression (72). We saw in Secs. 3.2 and 3.3 that the scalar power 
spectrum derives from the gravitational response to quantum matter, the same 
way that all the solar system tests of general relativity derive from the gravitational 
response to classical matter. Were we to insist that only gravitons can test quantum 
gravity then logical consistency would imply that only gravitational radiation tests 
classical gravity, at which point we are left with only the binary pulsar data! 

Indeed, a little reflection on the problem of perturbative quantum gravity!4 
reveals that the lowest order problem is not from gravitons — which cause no 
uncontrollable divergences until two loop order!?:!° — but rather from exactly the 
same gravitational response to quantum matter which the scalar power spectrum 
tests. All experimentally confirmed matter theories engender quantum gravitational 
divergences at just one loop order.* ® If a sensible quantum gravity expert was told 
he could only know one of the two power spectra and then asked to choose which 
one, he ought to pick AGE) because it tells him about the lowest order problem. 
Fortunately, we will know both power spectra, and probably sooner rather than 
later. It is even possible we will eventually resolve one loop corrections. 

The final objection could be re-stated as, “it isn’t quantum gravity if we can’t 
make a unique prediction for it.” This seems as ridiculous as trying to argue that 
galactic rotation curves don’t necessarily derive from gravity just because we are 
not yet certain whether their shapes are explained by Newtonian gravity with dark 
matter or by some modification of gravity. Which is not to deny how wonderfully 
improved the situation would be with a compelling model for inflation. If we had 
one then the two power spectra would provide a definitive test of quantum gravity, 
the same way that the photo-electric effect and Bhabha scattering test quantum 
electrodynamics. 
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Sceptics are free to accuse me of unwarranted optimism, but I believe that 
working out what drove primordial inflation is just a matter of time in the data- 
rich environment which is developing. Measurements of n, with increasingly tight 
upper bounds on r9,992 have already ruled out some potentials such as V(y) ~ or 2 
and all models with constant €(t).394° This process is bound to continue, and even 
accelerate, as the data gets better. There are plans to reduce the errors on n, by 
a factor of five using galaxy surveys.°® (This will begin filling in the question mark 
region of reheating on Fig. 1.) If the BICEP2 detection really means 19.992 ¥ 0.20, 
it would rule out a host of models with small r9.992.2” We will know within the next 
five years by checking if the BICEP2 signal possesses the key frequency and angular 
dependences needed to distinguish it as primordial gravitons. If so then it should be 
possible to reduce the errors on 1’p,992 to the percent level within the next decade. As 
higher resolution polarization measurements are made over the course of the next 
15 years, it should be possible to remove the gravitational lensing signal (known 
as “de-lensing”) to reach the sensitivity needed to measure n;. It is inconceivable 
to me that theorists will remain idle while these events transpire. Past experience 
shows that theory and experiment develop synergistically. The data are not going 
to run out any time soon, and I believe fundamental theorists will eventually receive 
enough guidance to develop a truly compelling model for primordial inflation. 

Let me close this section by pointing out that just the fact of observing scalar 
perturbations from primordial inflation tells us two significant things about quan- 
tum gravity": 


e It is no longer viable to avoid quantizing gravity; and 
e The problem of ultraviolet divergences cannot be explained by making spacetime 
discrete. 


The first point is obvious from the fact that the scalar power spectrum represents the 
gravitational response to quantum fluctuations of matter, which would be absent if 
the source of classical gravity were taken to be the expectation value of the matter 
stress tensor in some state. To see the second point, note that although discretiza- 
tion at any scale makes quantum gravitational loop integrals finite, it will not keep 
them small unless the discretization length is larger than \/hG/c3 ~ 1.6 x 10734 m. 
But primordial inflation posits that the universe has expanded by the staggering 
100 —~ 1043 from a time when quantum gravitational effects were 
small. Hence, the current comoving scale of discretization would correspond to 
about a million kilometers! 


factor of about e 


4. Loop Corrections to the Power Spectra 


From (88) one can see that the scalar power spectrum is currently measured with 
an accuracy of more than two significant figures. However, resolving the one loop 
correction would require about 10 significant figures because the loop counting 
parameter of inflationary quantum gravity is no larger than hGH?/c? ~ 107'°. 


II-372 R. P. Woodard 


Although there is no hope of achieving this precision within the next two decades, 
the data is potentially recoverable and theorists have begun thinking about how 
to predict the results when (and if) one loop corrections are resolved in the far 
future. This section describes the basic formalism and the significant issues. I close 
by adumbrating a process through which the missing eight significant figures might 
be made up. 


4.1. How to make computations 


I will return later in this section to the issue of precisely what theoretical quantities 
correspond to the observed scalar and tensor power spectra. For now let me assume 
that the tree order definitions (57), (58) remain valid. One striking fact about these 
expressions is that neither of them is an S-matrix element. Nor is either the matrix 
element of some product of noncoincident local operators (because both are at the 
same time) between an “in” state which is free vacuum at asymptotically early-times 
and an “out” state which is free vacuum at asymptotically late-times. One can define 
a formal S-matrix for the simplest cosmologies®® but it calls for measurements which 
are precluded by causality. More generally, the entire formalism of in-out matrix 
elements — which is all most of us were taught to calculate — is inappropriate 


100 and no one 


for cosmology because the universe began with an initial singularity 
knows how it will end. Persisting with in-out quantum field theory would make 


loop corrections possess two highly undesirable features: 


e They would be dominated by assumptions about the “out” vacuum owing to vast 
expansion of spacetime; and 

e The matrix elements of even Hermitian operators would be complex num- 
bers because the “in” and “out” vacua must differ due to inflationary particle 
production. 


The more appropriate quantity to study in cosmology is the expectation value of 
some operator in the presence of a prepared state which is released at a finite time. 
Of course one could always employ the canonical formalism to make such computa- 
tions, but particle physicists yearn for a technique that is as simple as the Feynman 
rules are for in-out matrix elements. Schwinger devised such a formalism for quan- 
tum mechanics in 1961.!°! Over the next two years, it was generalized to quantum 
field by Mahanthappa!°? and by Bakshi and Mahanthappa.!°?!04 Keldysh applied 
it to statistical field theory in 1964 (Ref. 105) where the technique has become rou- 
tine. Until very recently, its use in quantum field theory was limited to a handful 
of people working on phase transitions and gravity.!°° 19 Most particle theorists 
were majestically ignorant of the technique and so attached to the in-out formal- 
ism that they dismissed as mistakes what are significant and deliberate deviations 
of the Schwinger—Keldysh formalism, such as the absence of an imaginary part. 
The stifling atmosphere which prevailed is well-conveyed by the lofty disdain in the 
words of a referee I had for a 2003 grant renewal proposal to the Department of 
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Energy: 


In his work with Tsamis, Woodard has focused on what they interpret as an 
instability of de Sitter space due to a two loop IR divergence associated with 
long wavelength virtual gravitons. They describe this as the accumulation 
of gravitational attraction of “large-wavelength virtual gravitons.” That is 
a puzzling statement in itself — the accumulation that they describe would 
build up only if gravitons were really being produced. In fact, they think 
these virtual gravitons are rendered real as they are “pulled apart by rapid 
expansion of spacetime.” I believe that there is absolutely no evidence for 
this. Real particle production should show up as an imaginary contribution 
to the graviton vacuum polarization tensor, at least if unitarity in de Sitter 
space resembles flat space. 


The thinking of particle theorists underwent a radical transformation in 2005 
when Nobel laureate Weinberg undertook a study of loop corrections to the power 
spectra.2! He quickly realized that the in-out formalism was inappropriate and, 
because he did not then know of the Schwinger—Keldysh formalism, he indepen- 
dently discovered a version of it which is better suited to this problem than the 
usual one. (His student Bua Chaicherdsakul told Weinberg of the older technique, 
and he gave full credit to Schwinger in his paper). I well recall the day Weinberg’s 
paper appeared on the arXiv. I chanced to be visiting the University of Utrecht then 
and a very knowledgeable and not unsympathetic colleague commented, “I guess 
I will finally have to learn the Schwinger—Keldysh formalism.” Weinberg’s words 
on the general problem of computing loop effects in primordial inflation are also 
worth quoting in defence of the intellectual curiosity which is sometimes lacking in 
particle theory: 


This paper will discuss how calculations of cosmological correlations can be 
carried to arbitrary orders of perturbation theory, including the quantum 
effects represented by loop graphs. So far, loop corrections to correlation 
functions appear to be much too small ever to be observed. The present 
work is motivated by the opinion that we ought to understand what our 
theories entail, even where in practice its predictions cannot be verified 
experimentally, just as field theorists in the 1940s and 1950s took pains to 
understand quantum electrodynamics to all orders of perturbation theory, 
even though it was only possible to verify results in the first few orders. 


The best way to understand the Schwinger—Keldysh formalism is by relating 
its functional integral representation to the canonical formalism. Recall how this 
goes for the in-out formalism in the context of a real scalar field $(t,x) whose 
Lagrangian is the spatial integral of its Lagrangian density, 


L{o(é)] = 7 dx £[p(t,)]. (96) 
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The in-out formalism gives matrix elements of T*-ordered products of operators, 
which means that any derivatives are taken outside the time-ordering symbol. The 
usual relation is adapted to asymptotic scattering problems but, for our purposes, it 
is better to consider the matrix element between a state |W) whose wave functional 
at time t = tg is U[(t2)| and a state (®| with wave functional ®/d(t,)|. The 
well-known functional integral expression for the matrix element of the T’*-ordered 
product of some operator O,[d] is, 


(®|T*(Oa[9])|¥) = |i Oald] [ots e* Sz HH] WL (ty)]. (97) 


We can use (97) to obtain a similar expression for the matrix element of the anti- 
T*-ordered product of some operator O,|¢] in the presence of the conjugate states, 


(v|T" (Os[9))|®) = (®|T*(O}[6])|W)* (98) 
7 | [44] Os[¢] [p(t J eW* Sez #2] we [s(ty)]. (99) 


Summing over a complete set of wave functionals ®[¢(t1)] gives a delta functional, 
d- Plo—(t1)] ©" [b+ (t)] = 616-() — 44(4))- (100) 
b 


Multiplying (97) by (99), and using (100), gives a functional integral expression 
for the expectation value of any anti-T*-ordered operator O, multiplied by any 
T*-ordered operator Og, 


(¥[T" (Op[d))T* (Calg) 1¥) 


= | eon ]6[¢-(4) — 644) 


x Op[d-JOald4]¥*[d- (toler Fe MHF +Ol-L16- Ol WI, (ta)]. (101) 


This is the fundamental Schwinger—Keldysh relation between the canonical operator 
formalism and the functional integral formalism. 

What we might term the “Feynman rules” of the Schwinger—Keldysh formalism 
follow from (101) in close analogy to those for in-out matrix elements. Because 
the same field operator is represented by two different dummy fields, ¢4(x), the 
endpoints of lines carry a + polarity. External lines associated with the operator 
O,|¢] have the — polarity while those associated with the operator O,[¢] have 
the + polarity. Interaction vertices are either all + or all —. The same is true for 
counterterms, which means that mixed-polarity diagrams cannot harbor primitive 


divergences. Vertices with + polarity are the same as in the usual Feynman rules 
whereas vertices with the — polarity have an additional minus sign. Propagators 
can be ++, —+, +— and 
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The four propagators can be read off from the fundamental relation (101) when 
the free Lagrangian is substituted for the full one. I denote canonical expectation 
values in the free theory with a subscript 0. With this convention one sees that the 
++ propagator is the ordinary Feynman result, 


iA, (@;2") = (QIT(4(e) 6(2’))|M)o = tA(@;2"). 102) 
The other cases are simple to read off and to relate to (102), 
iA_,(a;2") = (Q\d(x)9(2')|Q)0 = A — t)iA(a; 2’) + Ot — t)[iA(a;2’)]*, (103) 
iA, (a; 2") = Q\e(x’)o(@)|Q)o = O(t — #)[PA(a;a')]* + Ot — tiA(@;2'), (104) 
iA__(x; 2") = (QT ((x)9(2"))|Q)o = [FA(a; 2')]*. 105) 


The close relations between the various propagators and the minus signs from — 
vertices combine to enforce causality and reality in the Schwinger—Keldysh formal- 
ism. For example, in a diagram with the topology depicted in Fig. 2, suppose the 
vertex at x is connected to an amputated external + line. If the vertex at a’” is 
internal then we must sum over + and — variations and integrate to give a result 
proportional to, 


[Pe'id..2)) - [iA (a;2")}} =2i f act! { d?-Ye' im([i(¢5 2") 
(106) 


Although expression (97) is simple to derive from the canonical formalism, few 
particle theorists would have recognized it before Weinberg’s paper for two reasons: 


e The action integral runs between the finite times tg < t < t); and 
e It contains state wave functionals U[¢(t2)] and ®*[(t1)]. 


The over-specialization of quantum field theory to asymptotic scattering problems 
led to generations of particle theorists being inculcated with the dogma that it is 
irrelevant to consider any state but “the” vacuum (often defined as “the unique, 
normalizable energy eigenstate”), and that this state is automatically selected by 


ah 


/ 
v L 
Fig. 2. A typical Schwinger—Keldysh loop. The vertex at x connects to an amputated + leg. 
The vertex at x2/” is internal and must be summed over + and — polarities. The cancellation 
between polarities makes the integral (106) pure imaginary and restricts the integration to points 
z’” on or within the past light-cone of «x. 
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extending the temporal integration to —oo < t < +oo. This was always nonsense, 
but it sufficed for asymptotic scattering theory as long as IR problems were treated 
using the Bloch—Nordsieck technique,'!° the universal applicability of which also 
became dogma in spite of simple counterexamples.!!! 

Inflationary cosmology has forced us to consider releasing the universe in a 
prepared state at a finite time. When this is done one realizes that the state wave 
functional W[¢(t2)| can be broken up into a free part, whose logarithm is quadratic 
in the perturbation field, and a series of perturbative corrections involving higher 


powers of the field, 
V[O(t2)] = Volo(ta)|{1 + O(¢° (t2))}- (107) 


For example, the free vacuum state wave functional of a massive scalar in flat 
space is, 


mc 
ie 


Qo[P(t2)] « exp - 3 fos (te, x)4/—V? + i) - (108) 
It can be shown that the free part of the vacuum wave functional combines with 
the quadratic surface variations of the action to enforce Feynman boundary con- 
ditions.‘!? Rather than the usual hand-waving, that is how inverting the kinetic 
operator gives a unique solution for the propagator. The perturbative correction 
terms (107), which must be present even to recover the flat space limit, correspond 
to nonlocal interactions on the initial value surface.?° 


4.2. e-Suppression and late-time growth 


Making exact computations requires the ¢(t,x) and h,j;(t,x) propagators and 
their interaction vertices. From the free Lagrangian (54), and the appropriate 
D-dimensional generalization of the scalar mode function (63), one can give a formal 
expression for the ¢ Feynman propagator, 
3 ! ak ! * (yl / * / ik-x 
iA¢(a; 2°) = (amyb=1 tt —t v(t, k)u* (tk) + O(t — thu*(t, ku’, k) fe. 
T 
(109) 


Expressions (55) and (64) give a similar result for the graviton Feynman propagator, 
, 1 emia 
liz Ane](2; 2") = ath = 5 apttl| x i) (nP=1 
x {0(t — t')u(t, k)u*(t’, k) + O(t! — t)u*(t, k)u(t',k)}e"*™*, (110) 


where the transverse projection operator is II;; = 6;; — 0:0; /0,0,. Unfortunately, 
we do not possess simple expressions for either the scalar or tensor mode functions 
for a general expansion history a(t), nor are all the gauge-fixed and constrained 
interactions yet known to the order required for a full one loop computation, and 
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nothing has been done about renormalization. I will therefore concentrate on char- 
acterizing how loop corrections behave with respect to the two most important 
issues which control their strength: 


e Enhancement by inverse factors of the slow roll parameter e(t); and 
e Enhancement by secular growth from IR effects. 


To understand the issue of e-enhancement, let us first note from the free Lagrangians 
(54), (55) that the scalar and tensor propagators have the following dependences 
upon ¢(t) and the various fundamental constants, 


hG AG 
iA¢ ~ —— x Frequency”, iA; ~ — & Frequency”. (111) 
Ce c 


For the effects of inflationary particle production the relevant frequency is the Hub- 
ble parameter H(t). (Of course it could be at any time in the past, as could the 
factor of 1/e(t) in the ¢ propagator). This offers a very simple explanation for 
the approximate forms (72) I derived for the tree order power spectra in Sec. 3.3. 
The relevant diagrams are given in Fig. 3. 

To find the gauge-fixed and constrained interactions one must solve the con- 
straint equation (56) for S*[¢,h], then substitute back into (49). There are many 
terms, even at the lowest orders, and they generally combine (sometimes after par- 
tial temporal integrations) so that the final result is suppressed by crucial powers 
of «. Each term has two net derivatives, however, this counting must include —1 
derivatives from factors of 1/H, and —2 derivatives from factors of 1/0,0, which 
arise in solving the constraint equation (56). The ¢? interaction was derived in 2002 
and Maldacena,*! and simple results were obtained in 2006 for the ¢4 terms by Seery 
et al.'!8 At the level of detail I require these two interactions take the form, 


1 ea? 1 tea?! 2 
~ ~ » LP 
BS ~ Tenng 06% 5 hct~ Terag § 966 i) 
In 2007 Jarhus and Sloth discussed the next two interactions,!!4 
1 ear, cba 
ee, ~ ——_—_. : 11 
RES ~ jenna § O60 Lee ~ eRe $0606 ao 


Results for the lowest ¢-h;; interactions were reported in 2012 by Xue et al. 


Making no distinction between which fields are differentiated, these interactions 
take the general form, 


cteaP-1 


Tenng [Sonor + hOGA + h?0CdC]. (114) 


il 
7 Eon? + Len + Le2n2] ~ 


oe cc” ~~ +#@ ewe 


Fig. 3. Diagrammatic representation for the tree order power spectra. A straight line represents 
the ¢ propagator while the graviton propagator is wavy. 
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+ 


Fig. 4. One loop corrections to the scalar power spectrum. Straight lines represent the ¢ propa- 
gator while the graviton propagator is wavy. 


And because they persist even in the de Sitter limit of € = 0 it is obvious that the 
purely graviton interactions are not e-suppressed, 


cqP-1 


I~ Témna | 

The various diagrams which contribute to the one loop correction to A%(k) are 
depicted in Fig. 4. In each case, the leftmost point is fixed at «" = (t,x) and the 
rightmost point is fixed at 2’“ = (t,0). Interior points are integrated. For example, 
the leftmost diagram on the first line has the general form, 


fPvidcaVeo) [Py dca Vow tdcwW?. (116) 


hOhOh + h?0hOh + - ++]. (115) 


1 
7 ens + Lpate--: 


where Ves(y) and Ves(y') denote the vertex operators one can read off from the 
¢3 interaction. To recover the ordering in (57) the x line must have — polarity 
and the 2’" must be +, while the y“ and y’" vertices would be summed over all + 
variations. 


I will return to the possibility that vertex integrations lead to temporal growth, 
but for now let me assume that the two net derivatives in each vertex combine with 
the associated integral to produce a factor of c/H?. Under this assumption one can 
estimate the strength of any diagram by combining: 


e A factor of hGH?/c’« for each ¢ propagator; 

e A factor of hGH?/c° for each h;; propagator; and 

e A factor of c’e /RGH? for each vertex with either 2N — 1 or 2N ¢ fields and 
any number of hy; fields. 


For example, the estimated result for the leftmost diagram on the first line of 
Fig. 4 is, 


AGH?\*  (RGH?\°  ( ce \?  (RGH?\"? _ (RGH? hGH?e 
Oe 7 on “\anGH?) ~ \ ee age on 


(117) 


In the final expression of (117) I have extracted the tree order result (72), so one 
sees that this one loop correction is down by the factor of hGH?/c° (which was 
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So. h 


Fig. 5. One loop corrections to the tensor power spectrum. Straight lines represent the ¢ propa- 
gator while the graviton propagator is wavy. 


inevitable on dimensional grounds) times an extra factor of ¢. Neither of the two 
diagrams on the bottom line of Fig. 4 has this extra suppression, 


AGH? AGH? ce \' (GH? (RG 18) 
oe ro hGH?) Oe oe J? 
AGH? cai hGH? - ce \' _ (AGH? (RG 19) 
oe © hGH?) Oe e& } 


The one loop corrections to A? (k) are depicted in Fig. 5. The same rules suffice 
to estimate the strengths of these corrections, although one must recall that the 
tree order result (72) has no e-enhancement. For example, the leftmost diagram on 
the first line of Fig. 5 contributes, 


hGH?\*  (RGH?\*  (_ce_\” _ (hGH?) | (hGH? 90 
ce oe hGH?) 3 eo } ) 


None of the one loop corrections to A?(k) is any stronger than (120); the cen- 
tral diagram on the first line is actually suppressed by an additional factor of e. 
We therefore conclude that one loop corrections to each of the power spectra are 
generically suppressed from the tree results (72) by a factor of RGH?/c? ~ 10719. 
In these estimates it will be noted that I have not specified when the various 
factors of H(t) and e(t) are evaluated. Both quantities are thought to be nearly 
constant during much of primordial inflation — in which case, it does not matter 
much when they are evaluated. However, it is well to recall that the actual loop cor- 
rections are integrals of sometimes differentiated propagators, like expression (116). 
Weinberg noted the possibility for these integrations to grow with the co-moving 
time.?! That this can happen is associated with IR divergences of the ¢ and hij 
propagators which are evident from the small k limiting form (69) of both mode 
functions for constant ¢.!'® This leads to two sources of possible secular growth: 


e A coincident propagator — such as the four diagrams on the bottom lines of 
Figs. 4 and 5 — grows like AGH?/c> x Ht;46447118 and 

e When the analogous in-out expression would be IR divergent, a Schwinger— 
Keldysh integration such as (106) is finite but grows with time.'!® 
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The physical origin of both effects is that even the long wavelength parts of the ¢ 
and hj; effective actions are affected by the on-going process of inflationary particle 
production. 

Weinberg proved an important theorem which limits the growth of loop cor- 
rections to the primordial power spectra for the single-scalar model (27) plus an 
arbitrary number of free scalars which are minimally coupled to gravity.?? His result 
is that the largest possible secular enhancement to the depressingly small estimates 
(119) and (120) consists of powers of the number of inflationary e-foldings. His 
student Bua Chaicherdsakul extended the result to cover fermions and gauge par- 
ticles.!2° However, the situation changes radically if one allows matter couplings 
to the inflation because the resulting Coleman—Weinberg corrections to its effective 
potential can induce important changes in the expansion history. For example, if 
an m7? inflation were coupled to a massless fermion the resulting negative energy 
Coleman—Weinberg correction would cause the universe to end in a Big Rip sin- 
gularity.!?! Because the gauge (38) forces the inflation to agree with its classical 
trajectory, changes in the physical expansion history manifest in secular growth of 
¢(t,x) correlators. It should also be noted that Weinberg’s theorem is limited to 
the inflationary power spectra. Explicit computations show that loop corrections to 


122 123 


other correlators such as the vacuum polarization’~* and the fermion self-energy 


can grow like powers of the inflationary scale factor. 


4.3. Nonlinear extensions 


No one disputes Weinberg’s bound, but some cosmologists disagree that there can 
be any secular corrections. Weinberg gave two examples,?! which other authors 
confirmed.!24 However, Senatore and Zaldarriaga identified a problem with the use 
of dimensional regularization in one of these examples, and went on to argue that no 
secular enhancements are possible under any circumstances.!?° 
that models can be devised for which quantum corrections to the naive correlators 
(57), (58) grow with timelike powers of the number of e-foldings, just as Weinberg 
stated.”! Close examination of claims to the contrary!2° !°8 reveals that the authors 
are not actually disputing this, but rather arguing that the naive correlators (57), 
(58) should be replaced with other theoretical quantities which fail to show secular 
growth. That brings up the fascinating and crucial issue of what operators represent 
the measured power spectra. 

The problem with trying to overcome the loop suppression through secular 
enhancements is that the growth begins at first horizon crossing and terminates 
with the end of inflation. But observable modes experienced first horizon crossing 
at most 50 e-foldings before the end of inflation, which means the enhancement 
can be at most some small power of 50. The issue which focused people’s attention 


It seems very clear 


on modifying the naive observables was not secular growth but rather the closely 
associated problem of sensitivity to the IR cutoff. Ford and Parker showed in 1977 
that the propagator of a massless, minimally coupled scalar has an IR divergence 
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for any constant ¢(¢) cosmology in the range 0 < «€ < 3116 In view of relations 
(67), (68) this same problem afflicts both the ¢ and h,; propagators. Like all IR 
divergences, this one derives from posing an unphysical question. The problem in 
this case is arranging large correlations for super-horizon modes which no local 
observer can control. There are two fixes which have been suggested: 


e Either arrange for the initially super-horizon modes to be in some less highly 
correlated state!9; or else 
e Work on a spatially compact manifold such as T?~! whose coordinate radius is 


such that there are no initially super-horizon modes.!°° 


In practice, each fix amounts to cutting off the Fourier mode sum at some minimum 
value k = L~!. If IR divergences could be shown to afflict loop corrections to the 
power spectra, and if the cutoff DZ were large enough, then loop corrections might 
be significant. 

I recommend the review article by Seery on IR loop corrections to inflation- 


131 Important work was done by a number of authors.!°?-!45 In 


ary correlators. 
2010 Giddings and Sloth were able to give a convincing argument that graviton 
loop corrections to A?,(k) are indeed sensitive to the IR cutoff L,14614” and hence 
able to make significant corrections. This disturbed people who think about gauge 
invariance in gravity because the actual IR divergence — as opposed to the closely 
associated secular growth factor — is a constant in space and time, and a con- 
stant field configuration h,;(t,x) ought to be gauge equivalent to zero. Even before 
the work of Giddings and Sloth, the possibility of such corrections had prompted 
Urakawa and Tanaka to argue for modifying the original definition (57) so that 
the spatial argument of the first field ¢(t,x) is replaced by the metric-dependent 
geodesic X[g|(x) which is a constant invariant length ||x|| from the other point 0 
in the direction «,!48:149 
3 


Az (kt) + 5 f dre ™™*(O1C(¢, Xlo] C(t, 0)10). (121) 


After the paper by Giddings and Sloth, it was quickly established that these sorts 
of partially invariant observables are free of the IR. divergence.'°° 15° In subsequent 
work, Giddings and Sloth have sought to identify invariant observables which still 
show the enhancement.!°7!58 Tanaka and Urakawa have also continued their work 
on the problem,!°? 162 

The discussion of IR effects attracted me because I had for years been working 
on these in de Sitter background. I was also fascinated by the struggle to identify 
physical observables in quantum gravity because my long-neglected doctoral thesis 
dealt with that very subject.'® In fact had I considered corrections involving pre- 
cisely the same sort of geodesics as in (121)! One thing I discovered is that they 
introduce new ultraviolet divergences associated with integrating graviton fields 
over the one-dimensional background geodesic.'® These new divergences change 
the power spectrum into the expectation value of a nonlocal composite operator 
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which no one currently understands how to renormalize.?? Shun-Pei Miao and I 
also demonstrated that (121) disturbs the careful pattern of e-suppression which 
we saw in Sec. 4.2; one loop corrections to (121) go like the tree order result (72) 
times hGH? /c°«.?3 For the very same reason, non-Gaussianity would also be unsup- 
pressed.?? So changing what theoretical quantity, we identify with the scalar power 
spectrum from (57) to (121) in order to avoid sensitivity on the IR cutoff would 
come at the high price of introducing uncontrollable ultraviolet divergences and 
observable non-Gaussianity. It seems a bad bargain, and I mean no disrespect to 
colleagues who are struggling to puzzle out the truth, as am I, when I say we must 
do better. 

It seems clear to me that we need new ideas. One radical and thought-provoking 
proposal is the suggestion by Miao and Park to abandon correlators altogether and 
instead quantum-correct the mode function relations (65), (66).24 Among other 
things, this would avoid the new ultraviolet divergence which Fréb et al. have 
found in one loop corrections to the tensor power spectrum because the two times 
coincide.1®3 

It also seems to me that too few physicists appreciate the wondrous opportunity 
which has befallen us to shape a new discipline by defining its observables. The 
debate on this vital subject is sometimes confused, and too often degenerates into 
shouting matches. In an effort to clarify matters Shun-Pei Miao and I laid out 10 
principles which are worth repeating here?*: 


e 


IR. divergence differs from IR growth; 

The leading IR logs might be gauge independent; 
Not all gauge dependent quantities are unphysical; 
Not all gauge invariant quantities are physical; 
Nonlocal “observables” can null real effects; 
Renormalization is crucial and unresolved; 
Extensions involving ¢ must be e-suppressed; 

It is important to acknowledge approximations; 
Sub-horizon modes cannot have large IR logs; and 
Spatially constant quantities are observable. 
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4.4. The promise of 21 cm radiation 


Particle physicists are familiar with the saying, “yesterday’s discovery is tomorrow’s 
background.” Cosmologists are today witnessing the final stages of this process in 
the context of observations of the cosmic microwave background, as interlocking 
developments in technology and understanding of astrophysical processes have per- 
mitted fundamental theory to be probed more and more deeply. A brief survey of 
the history is instructive: 


e 1964 — discovery of the monopole, for which Penzias and Wilson received the 
1978 Nobel Prize; 
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e 1970s — discovery of the dipole, which gives the Earth’s motion relative to the 
CMB; 

e 1992 — discovery of lowest higher multipoles in the temperature-temperature 
correlator by COBE, for which Mather and Smoot received the 2006 Nobel Prize; 

e 1999 — detection of the first Doppler peak by BOOMERanG and MAXIMA, 
supporting inflation and not cosmic strings as the primary source of structure 
formation; 

e early 2000s — detection of E-mode polarization by DASI and CBI, and demon- 
stration by WMAP of the T-F anti-correlation predicted by inflation; 

e 2003-2010 — full-sky maps of temperature and E-mode correlators by WMAP, 
and their use for precision determinations of cosmological parameters; 

e 2013 — full-sky map of Planck resolves seven Doppler peaks and give tighter 
bounds on ACDM parameters; 

e 2013 — First detection of B-mode polarization from gravitational lensing by the 
South Pole Telescope; 

e 2014 — Detection of primordial B-mode polarization claimed by BICEP2, con- 
firming another key prediction of primordial inflation, fixing the inflationary 
energy scale to be ~ 2 x 10'6 GeV, and incidentally establishing the existence 
and quantization of gravitons; and 

e 2014 — Resolution of six acoustic peaks of E-mode polarization by the Atacama 
Cosmology Telescope Polarimeter, which provides an independent determination 
of ACDM parameters. 


The first few steps are even now being taken in what could be an equally fruitful 
evolution, whose full realization will consume decades as it yields a steady series of 
discoveries. I refer to the project of surveying large volumes of the universe using 
the 21 cm line.!®* The discovery potential is obvious from the comparison between 
an X-ray and a CT scan: all that has been learned from the cosmic microwave back- 
ground derives from the surface of last scattering, whereas 21cm radiation allows 
us to make a tomograph of the universe. 
Current and planned projects probe two regimes of cosmic redshift: 


e 0 Sz <4 — in which the radiation from unresolved galaxies is observed to probe 
baryon acoustic oscillations.'6° 1” 
e 6 Sz < 10 — in which intergalactic Hydrogen is observed to probe the epoch of 
reionization.!7! 1” 
The first of these provides important information for understanding the mysterious 
physics which is causing the current universe to accelerate, the discovery of which 
earned Perlmutter, Schmidt and Riess the 2011 Nobel Prize. The second is crucial 
to understanding the first generation of stars, and will eventually be an important 
foreground in future observations. 
As technology and engineering improve, and as astrophysical effects are better 
understood, it is possible to foresee a time (decades from now) when redshifts as 
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high as z ~ 50 are observed to measure the matter power spectrum with staggering 
accuracy. There is enough potentially recoverable data in the 21 cm radiation to 
resolve one loop corrections.17® Current measurements of A}, (k) do not test fun- 
damental theory because we lack a compelling mechanism for driving primordial 
inflation, but that is bound to change over the decades required for the full maturity 
of 21 cm cosmology. And when we do understand the driving mechanism, it will 
be possible to untangle the one loop correction from the tree order effect, which 
will test quantum gravity. This could be for quantum gravity what the mea- 
surement of g — 2 was for quantum electrodynamics. The data is there, and 
people will be working for decades to harvest it. 


5. Other Quantum Gravitational Effects 


As I have explained, the driving force for quantum gravitational effects during 
inflation is the production of nearly massless, minimally coupled scalars (if there 
are any) and gravitons. The presence of these particles is quantified by the scalar 
and tensor power spectra. Because Einstein + anything is an interacting quantum 
field theory, the newly created particles must interact, at some level, both with 
themselves and with other particles. This section describes how to study those 
interactions. I first list the various linearized effective field equations, then I describe 
the propagators and how to represent the tensor structure of the associated 1PI 
two-point functions. The section closes with a review of results and open problems. 
However, the issue of back-reaction is so convoulted and contentious that it merits 
its own subsection. 


5.1. Linearized effective field equations 


We want to study how the propagation of a single particle is affected by the vast 
sea of IR gravitons and scalars produced by inflation. That can be done by com- 
puting the 1PI two-point function of the particle in question and then using it 
to quantum-correct the linearized effective field equation. Recall from (10) that 
quantum gravitational loop corrections from inflationary particle production are 
suppressed by hGH?/c? ~ 10~'°. Because this number is so small it is seldom 
necessary include nonlinear effects or to go beyond one loop order. The usual unit 
conventions of relativistic quantum field theory apply in which time is measured so 
that c = 1, and mass is measured so that h = 1. The loop counting parameter of 
quantum gravity is K? = 167G(xh/c?) + 1.3 x 10-8 m?. 

Because €; © 0.013 is so small, most work is done on de Sitter background, for 
which e(t) = 0 and A(t) is a constant. Computations are done on a portion of 
the full de Sitter manifold which is termed “the cosmological patch” in the recent 
literature, and sometimes “open conformal coordinates” in the older literature, 


1 
ds* = a?(n)[—dn* + dx- dx], a(n) = “oa a (122) 
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The D — 1 spatial coordinates exist in the same range —o0 < a < +00 as Minkowski 
space, but the conformal time 77 is limited to the range —co < 7 < 0. The de Sitter 
metric is guy = anv, Where nv is the Minkowski metric. In contrast to Sec. 4, 
metric fluctuations are characterized by the conformally rescaled and canonically 
normalized graviton field hyv, 


a2 ta) = a? [nav + Khpr(2)]. (123) 


Graviton indices are raised and lowered with the Minkowski metric, h*, = n"?Npv, 
REY = Pn’? Rog, h = NY’ Ryv. Fermion fields are also conformally rescaled, 


pel(2) = at x v;(2). (124) 


The various 1PI two-point functions are evaluated using dimensional regular- 
ization, then fully renormalized with the appropriate counterterms in the sense of 
Bogoliubov and Parasiuk,!’” Hepp!”® and Zimmermann!”?-!8° (BPHZ). After this 
the unregulated limit of D = 4 is taken. As explained in Sec. 4.1, quantum correc- 
tions to the in-out effective field equations at spacetime point 2" are dominated 
by contributions from points z’" in the infinite future when the three-volume has 
been expanded to infinity. Quantum corrections to the in-out matrix elements of 
field operators are also generally complex, even for real fields. These results are cor- 
rect for in-out scattering theory, but they have no physical relevance for cosmology 
where the appropriate question is, what happens to the expectation value of the field 
operator in the presence of a prepared state which is released at some finite time. 
One solves that sort of problem using the Schwinger—Keldysh formalism.1°° 1°? In 
this technique each of the 1PI N-point functions of the in-out formalism gives rise 
to 2% Schwinger—Keldysh N-point functions. It is the sum of the ++ and +— 1PI 
two-point functions which appears in the linearized Schwinger—Keldysh effective 
field equation. This combination is both real and causal. 

The 1PI two-point function for a scalar is known as its “self-mass-squared,” 
—iM?(x; 2’). The quantum-corrected, linearized field equation for a minimally cou- 
pled scalar with mass ™ is, 


a?(—02 — 2Hadh + V)p(z) — m?a*y(z) — facie a \y(@’)=0.. (125) 


The (conformally rescaled) fermion’s 1PI two-point function is called its “self- 


energy,” —i[;Xj|(a;2"). If yj, stands for the usual gamma matrices then the 
quantum-corrected, linearized field equation for a fermion with mass ™ is, 
ij Outs (x) — marp;(x) — fee lalee We) = 0 (126) 


The 1PI two-point function for a photon has the evocative name “vacuum polar- 
ization,” +7[*II’](a; 2’). If F’* = n’?n¥?(0,Aq — O,Ap) is the usual field strength 
tensor then the quantum-corrected Maxwell equation can be written, 


0, F’* (a2) + / da HTT’ \(x; 2’) A(x’) = J*“(2), (127) 


TI-386 R. P. Woodard 


where J“(x) is the current density. Finally, the 1PI two-point function for a (con- 
formally rescaled and canonically normalized) graviton is termed the “graviton 
self-energy,” —i["”0?7|(x; 2’). It is used to quantum-correct the linearized Einstein 
equation as, 


2 
Baa? £4” °F? shoo (z)] — / dh! [MY P?|(a; 2! hyo (a!) = —~ nn’ To (2), 
(128) 
where J), is the linearized stress tensor and I define, 
Vpod 1 a o)v Vv o 1 Vv a oO 
civeras — 51 Bip HP n ) — nf’? ] + st nl 7?) 
bee tl BY — 92092) te y)(B), (129) 


P 


The scalar equation (125) and its spinor counterpart (126) of course describe the 
propagation of scalars and spinors, respectively. The vector and tensor equations 
(127), (128) can be similarly used to study the propagation of dynamical photons 
and gravitons, but they also describe modifications of the electrodynamic and gray- 
itational forces. Dynamical quanta show no modification in flat space quantum field 
theory (enforcing that is what typically fixes the field strength renormalization), so 
it is at least possible that nothing happens as well during inflation. However, the 
force laws are guaranteed to show an effect during inflation because they do so in 
flat space background.!®! 183 


5.2. Propagators and tensor 1PI functions 


The symmetries of the general cosmological geometry (1) are homogeneity and 
isotropy. However, the de Sitter limit of e(¢) = 0 results in the appearance of two 
additional symmetries. Although it obvious to cosmologists that these extra sym- 
metries can be at best approximate for inflationary cosmology, they have exerted 
a powerful influence on mathematical physicists owing to the expectation that the 
full de Sitter group should play the same role in organizing and simplifying quan- 
tum field theory on de Sitter that Poincaré invariance has played for flat space. 
That expectation has remained unfulfilled owing to the time dependence intrinsic 
to inflationary particle production. 

In our D-dimensional conformal coordinate system (122), the $D(D +1) de 
Sitter transformations can be decomposed as follows: 


e (D —1) spatial translations, 
qg=n, gf’ =a +e. (130) 
e 4(D—1)(D — 2) spatial rotations, 


f=n, 2° = Re z!, (131) 
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e One dilatation, 
n=kn, cc =k’. (132) 
e (D—1) spatial special conformal transformations, 


_ n tes x —O'eee 
~ 1-20-x+ |lOll@z- 2’ ~ 1-20-x4+ |/O|l22-a° 


ta 


4 


(133) 


Homogeneity is (130) and isotropy is (131). The two additional symmetries which 
appear in the de Sitter limit of e(¢) = 0 are (132), (133). 

Although IR divergences induce de Sitter breaking, they do so in a limited 
way that leaves the largest part of the result de Sitter invariant. For dimensional 
regularization computations the best way to express this de Sitter invariant part in 
terms of the length function y(z; z), 


y(2; 2") = aa! H?|||x — x'||? — (ln — | — ie)*}. (134) 


Except for the factor of i¢ (whose purpose is to enforce Feynman boundary con- 
ditions) the function y(z;) is related to the invariant length ¢(x; 2’) from 2" to 


al, 


y(x; 2’) = 4sin? (5Hee2')). (135) 


The four Schwinger—Keldysh polarity variations (102)—(105) never affect the de 
Sitter breaking terms. They can all be obtained by making simple changes of the 
ie term in (134), 


= aa’ H?|||x — x’ ||? — (jy —7"| — te)", 136 


137 


= aa! H?|||x — x! ||? — (9 — 9! — te)’, 


) ( (136) 
) ( (137) 
y+—(a; 2") = aa! H?|||x — x'||? — (n — 7 +e)", (138) 
) ( (139) 


= aa! H?|\\x — x'||? — (In—n!| + ie)?). 139 


The best way of expressing higher spin propagators on de Sitter, as on flat space, 
is by acting differential operators on scalar propagators. I work with a general scalar 
propagator iA,(x; 2’) which obeys the equation, 
16? (x — 2’) 

va 
Here and henceforth, the index ba is bs = 2>+ and O stands for the covariant 
scalar d’Alembertian, 


(0 + (b? — 63.) A iAg(2; 2’) = (140) 


o= Fs 0uV=a0"0.) = =3(-83 _(D—2)Ha+V?). (141) 


For the case of b < b4, the propagator has a positive mass-squared m? = (b%, — b?) H? 
and its propagator is a de Sitter invariant hypergeometric function of y(x;.’). 
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Its expansion for b = rv is, 


iAP Gye’) = for r (5 1) (=) 3 


T(ba tu +n) (ba —v +n) ey (142) 


D 4 
r(F +n) n! 


When 0b > ba the naive mode sum is IR divergent for the same reason as the 
problem discovered by Ford and Parker!!® which I described in Sec. 4.3. One some- 
times encounters contrary statements in the mathematical physics literature,!*4 
but close examination reveals that the authors admit they are constructing a for- 
mal solution to the propagator equation which is not a true propagator by the 
illegitimate technique of adding negative norm states to the theory.!®° The proper 


technique!?9!®° of cutting off the mode sum amounts to adding to (142) a de Sitter 
breaking, IR correction,186187 
HP? T)T2 
AR (a;a"') WIPO) oy — ba) 


nl*s*| 


N P] 
Ce dames (: = N-—n-—2m 
x C nm ~~ 2 ’ 
Py pore aa Pe Nnm(y ~ 2) 
(143) 
where the coefficients Cynm are, 
1\% 
(-7) T(iba+N+n-v) 
Cnnm = x 
min!(N —n — 2m)! T(ba +N -v) 
x T(ba) 7 Ti1-v) . T1-v) (144) 
T(bat+N-2m) T(l-v+n+2m) Td-v+m) 
The full propagator is therefore, 
imine |= lim [iAP (ar; 2") Ae a). (145) 


It is often useful to discuss integrated propagators which obey, 


[DO + (b? — b4,)H7]iA,.(a; z) = 1A, (2; z). (146) 
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The solution is easily seen to be,!87188 
1 
iAge(x; z) = a pe Acts z) — tAe(a; z)] = tAce(z; 2). (147) 
I also employ a doubly integrated propagator which obeys, 
(Do + (8? — 63.) B}iAbea(2; z) = iAcg(a; z). (148) 


The solution can be written in a form which is manifestly symmetric under any 
interchange of the three indices a, 6 and c, 
tApa(x; z) — tAge(z; Zz) 
(2 — d2)H? 
= (d? — c*)\iAy (a; z) + (6? — d?)iA. (2; z) + (Cc? — B?)iAa(a; 2) (150) 
(b? — c?)(c? — d?)(d? — b?)H4 
The propagator for a (conformally rescaled) fermion of mass m was constructed 


by Candelas and Raine! in terms of the scalar propagator (142) with vy = —$+i2, 


iAdea(; z) = 


(149) 


i[:S;](aj 2’) = [i7"O,, + am](aa’) >! 
‘. 1¢ =) iA (a a") + (4*) iA ae"), (151) 


Except for the conformal rescaling, expression (151) is de Sitter invariant as long as 
the mass is real. Tachyonic fermions with m? < —DH? break de Sitter invariance 
the same way that tachyonic scalars do. 

Vector and tensor fields raise the issue of gauge fixing. Normally we would 
accomplish this by adding to the action gauge fixing terms which respect the coor- 
dinate isometries of the background, however, there is an obstacle to doing this on 
any background such as de Sitter which possesses a linearization instability.!8°> This 
leaves two alternatives: 


e One either add a de Sitter breaking gauge fixing term which cannot be extended 
to the full de Sitter manifold; or 
e One can impose some gauge condition on the field operators. 


After some discussion of the problem, I will give results for both alternatives. 

The obstacle to adding invariant gauge fixing terms came as a surprise because 
one can actually derive them in flat space quantum field theory by starting from an 
exact gauge and making functional changes of variables.!°° In 2009 Miao, Tsamis 
and I identified precisely where this procedure breaks down when a linearization 
instability is present!8* but one can see the physics problem quite simply by con- 
sidering flat space electrodynamics on the manifold T? x R. Because the spatial 
sections are compact, both sides of the spatially averaged, 4: = 0 Maxwell equation 
must vanish separately, 


OF = I> | @PediF (t,x) = | Px J(t,x) =0. ee) 
T3 Ts 
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This zero charge constraint follows from the invariant field equations so it must be 
true in any valid gauge. However, adding a Poincaré invariant gauge fixing term 
results in a very different theory. The field equations of Feynman gauge are, 


[—a? + V7 A" Gx) = 7" (ex). (153) 


These equations can be solved for any total charge so there cannot be any question 
that the theory has been changed. 

The use of covariant gauge fixing terms is so ingrained that some people’s first 
reaction to the obstacle on de Sitter is, “let’s just go ahead and do it anyway!” 
As it happens, Emre Kahya and I had stumbled upon what goes wrong if that is 
done in 2005. We used two different gauges to compute and fully renormalize the 
one loop self-mass-squared —iM?(x; 2’) for a charged, massless, minimally coupled 
scalar on de Sitter.19!:!9? There was no problem with a de Sitter-breaking gauge 
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— nor is there any problem’’* when Lorentz gauge is enforced as a 


195 


fixing term 
strong operator condition’’? — but we found on-shell singularities when using the 
de Sitter-Feynman gauge propagator.!°° The origin of these singularities seems to 
be that integrating the self-mass-squared against the scalar wave function measures 
the AgJ°® interaction of the particle with its own field, and one can see from Eq. (153) 
that the solution for Ao(t,x) must grow like Qt?/2 in Feynman gauge. 


My favorite de Sitter-breaking gauge fixing term for electromagnetism is,!°? 


Lor = = 50? Ay — (D —4)HaAo)?. (154) 


Because space and time components are treated differently it is useful to have an 
expression for the purely spatial part of the Minkowski metric, 


Tio = Taw + ee (155) 


In this gauge, the photon propagator takes the form of a sum of constant tensor 
factors times scalar propagators, 


i[uAv] (2; 2") = 7, aa/tAp(a; 2’) — 69.5) aa'iAc(a; 2’). (156) 


The B-type and C-type propagators are special cases of (142) with vy = (D — 3)/2 
and v = (D —5)/2, respectively, 


aioe ey EG)" 


gyre a Tipe) fay], (AB) 
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(158) 


Note that the infinite sums in (157) and (158) vanish for D = 4, which means they 
only need to be included when multiplied by a divergence, and even then only the 
lowest terms of the sums are required. In fact the B-type and C-type propagators 
agree in D = 4, and the photon propagator in this gauge is the same for D = 4 as it 
is in flat space! Despite the noncovariant gauge, this propagator shows no physical 
breaking of de Sitter invariance. 


My favorite de Sitter-breaking gauge fixing term for gravity is,!9%:19” 
1 
Lor = —500 nt FF, Fi, 
eS 1 
=f” (aoe — 5H pan + (D—2)Harhapd. (159) 


Note that it breaks (133) but preserves (130)—(132), just like its electromagnetic 
cousin (154). In this gauge the graviton propagator takes the form of a sum of three 
constant index factors times a scalar propagator, 


tlw Apo] (a; 2") = = [ae Trlr as ). (160) 
I=A,B,C 


The constant index factors are, 


eo Bey 
oe 7 2 Tu(plo\y 2 D —3 melee (161) 
By 0 = 0 
[pw Tho] _ 457 Fatooes (162) 
2 


— l= D — 3)5°8) +7, y][(D — 3)6062 + 7,5]. (163 
[) eal (D—a)(D—3) |! ) a a Daal ) p | ( ) 
The de Sitter invariant B-type and C-type propagators appear as well in the photon 
propagator (156). The A-type propagator corresponds to (145) with b= (D —1)/2, 


and it is better known as the propagator of a massless, minimally coupled 
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scalar ,198:199 


iA (a; 2") 


z+) a =>, 2 
n- +2) rin +2) , ot (n+3) | 


where the constant A, is, 


a= | v(t =) ro (FS) + u(D +uay} (165) 


The de Sitter breaking factor of In(aa’) on the first line of (164) is intensely 
disturbing to mathematical physicists who insist that it must be an artefact of the 
de Sitter breaking gauge (159).2°° However, there are several ways of seeing that 
gravitons show real de Sitter breaking.?°! The simplest and most physical is to note 


that linearized gravitons in transverse-traceless gauge obey the same equation as the 
massless, minimally coupled scalar,?°3” whose propagator is admitted to break de 
Sitter invariance.!®° In fact, the breaking of de Sitter invariance is a consequence 
of the time dependence of inflationary particle production and is embedded in 
the primordial power spectra.!®° One should also note that the gauge fixing term 
(159) preserves dilatation invariance (132) while the propagator (164) breaks this 
symmetry. Finally, one can show that the propagator is not invariant even when 
naive de Sitter transformations are augmented to include the compensating gauge 
transformation needed to restore (159).70? 

It is also possible to enforce gauge conditions — even de Sitter invariant ones — 
as strong operator equations. When this is done, the resulting propagators can no 
longer be expressed using constant tensor factors. The most economical representa- 
tion is to write them as differential projection operators acting on scalar structure 
functions. The differential operators are constructed using the covariant derivative 
operator D, based on the de Sitter connection, 


TP = aH (5°59, + 59,69, — 1°? my). (166) 
We raise and lower indices using the de Sitter metric, DY = g'”D,. 


The Lorentz gauge condition D“A, = 0 implies that the photon propagator 
obeys, 


DM Aa aaa") =0= Di, Ay |e. (167) 
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If the photon has mass My (which can happen due to spontaneous symmetry 
breaking) then its propagator obeys the equation, 


[DO — (D -1)H? — MP ]iL Ap] (2; 2’) 
- igund” (2 — 2’) 
vg 
The term involving tA, on the right-hand side of (168) is required for consistency 


with the gauge condition of Eq. (167) of Ref. 195 and was missing from an earlier 
analysis.'9° We can find the transverse projector P,'; by writing the electromagnetic 


+ DD jtAalas 2). (168) 


field strength as Fug = P./5A,,. The photon propagator takes the form,?” 
. a KX DD", D dD! y 
i pAdl (a) =P ee 5H? * [sitesa') oie (169) 


The vector structure function can be expressed using a singly integrated propagator 
(147) Pe 


2H? 2H? D-3\” Me 
S|; =4 Me iApe Me [iAp —iAg], wewe=/(2=2) ——¥, (170) 


It is de Sitter invariant so long as MZ > —(D — 2)H?, which includes the usual 
massless photon. 

Recall from expression (123) that we define the graviton field h,,, by conformal 
rescaling. Although this is simplest for computations, enforcing de Sitter covariant 
gauge conditions is better done with the field yp, = OF ogy whose indices are raised 
and lowered with the de Sitter metric. The most general de Sitter covariant gauge 
condition can be parametrized with a real number (3 ¥ 2, 


DX aw = E Dux, = 0. (171) 


In any gauge of this form the x,,, propagator is the sum of a spin zero part and a 
spin two part,203:204 


iLapApo] (23 2") = ifapApe] (a; 2’) + ifaadja] (x; 2’). (172) 
The spin zero part is diagonal on the primed and unprimed index groups, 
iy A%g](032") = Paw x P! px lS0 (32). (173) 


The spin zero projector Py, is, 


= p= 
Puv = DyDv + Gu (or5)o 2(a-s) ). (174) 
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And the spin zero structure function can be expressed in terms of a doubly inte- 
grated propagator (150) ,203:204 

2(Df — 2)? 
(2 — 8)?(D — 2)(D —1) 
where by = (D+1)/2 and bi, = (2+)? 4 2( 253). Although By corresponds to a 
tachyonic mass of m? = —DH?, it turns out that this source of de Sitter breaking 
drops out when acted upon by the projectors.2°° The index by corresponds to a 


scalar of mass m? = —2( 355) H 2 and it will give rise to physical de Sitter breaking 


for B < 2. 
Constructing the spin two part of the propagator is roughly analogous to what 


So(x; 2’) = iA wwn(a;2"'), (175) 


was done for the photon. I first define the projector by expanding the Wey] tensor, 
Crs = PhP eg + O(k?x7). This operator can be used to build a manifestly 
transverse and traceless quantity, 


K D,DD',,D' 
ie ase ) ——4 ee x Pos ADh ee 
4H 
DgD'yy DsD' 
x So(030') (176) 


The (gauge independent) spin two structure function involves doubly integrated 
propagators (150) ,20% 

So(x;z) = op apelidaaales z) — 2iAgap +iAypea(a;z)]. (177) 
Although the B-type propagator is de Sitter invariant, we saw from expression 
(164) that the A-type propagator is not. An explicit computation shows that this 
de Sitter breaking does not drop out when all the various derivatives in (176) are 
acted.20 

I have already noted that mathematical physicists are very loath to accept de 
Sitter breaking in the graviton propagator, so it is perhaps not surprising that a 
final attempt was made to avoid it.2°° However, the net result was simply to clarify 
the illegitimate analytic continuations which must be employed to derive formal 
de Sitter invariant solutions that are not true propagators.!®° There are still some 
who believe that the de Sitter breaking evident in the all correct solutions (160)— 
(163) and (172)-(177) will drop out when gauge invariant operators are studied, 
as it does from the linearized Weyl-Wey] correlator.?°7?°8 I doubt this, and I will 
point out in Sec. 5.4 that it amounts to re-fighting the same controversy over fields 
versus potentials which was decided for electromagnetism by the Aharomov-Bohm 
effect. However, the important thing to note here is that everyone now agrees on 
the propagators which must be used to make such computations. 

Just as for their propagators, so it is best to represent the 1PI two-point func- 
tions of vector and tensor fields in terms of differential operators acting on structure 
functions. When the particular loop under study shows no physical de Sitter break- 
ing one could employ a de Sitter invariant representation. However, long experience 
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shows that it is usually superior to a employ simple, de Sitter breaking representa- 
tion, even when there is no physical breaking of de Sitter invariance.2°° 2"! For the 
vacuum polarization that form is, 


iT’) = (nf? — nt? n”? 0,0, F(a; x’) 
+ (NY TPS — TCHP? Op0, G(x; 2"). (178) 


For the graviton self-energy based on the conformally rescaled field h,,, defined in 
expression (123) it is, 


— 1h’? | (a; 2’) = FM x FP? [Fy (a: 2')] + G" x G'™ [Go(z;2’)] 
+ FRYPO| FS (x; 2')| + GUY?" [Go(a; 2"). 179) 
The scalar differential operators are, 


FEY = MOY + 2fraHO#OY) + foa?H?64,6% — ni” (0? + fsaHOo + fac? H?], 


ge = OO + In aHs ta” + goa” H?5",6% — WHY (V? + g3aH do + gaa”? H? |, 
181) 


where ordinary derivative indices are raised using the Minkowski metric and I 
remind the reader that a bar denotes the suppression of temporal indices, 7n’” = 
Nu» +659). The various constants in (180), (181) are, 


fi=fs=fa=(D-1), fo=(D-2)(D-1), (182) 
9 =93=94=(D-2), g2=(D—2)(D-1). (183) 


The spin-2 operators are constructed using a transverse-traceless projector obtained 
by expanding the Weyl tensor, Cag75 = a7?C,4.5" X Khyy + O(K7h?), 


aByd 
pe = Cupyé x nr ad x Arn nn? (184) 
ge = Copys x Cenod x ie ae Tl (185) 


5.3. Results and open problems 


As explained in Sec. 2, inflationary particle production is significant for gravitons 
and for massless, minimally coupled scalars. Any effect due to gravitons is, by 
definition, quantum gravitational, however, scalar effects may or may not be. An 
important example of scalar quantum gravitational effects is correlators of the field 
¢(t,x) which represents the gravitational response to a scalar inflation. On the 
other hand, it is possible to imagine a massless, minimally coupled scalar which 
is a spectator to de Sitter inflation, with gravity considered to be a nondynamical 
background. The effects from such scalars are not strictly quantum gravitational, 
but I will mention them as well because their physics is so similar, and because 
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they are so much simpler to study. For four such spectator scalar models there are 
complete, dimensionally regulated and fully renormalized results: 


e For areal scalar with a quartic self-interaction both the expectation value of the 
stress tensor!®®!99 and the self-mass-squared?!?:?!8 have been computed at one 
and two loop orders. These calculations show that inflationary particle production 
pushes the scalar up its potential, which increases the vacuum energy and leads 
to a violation of the weak energy condition on cosmological scales without any 
instability. 

e For a massless fermion which is Yukawa-coupled to a real scalar, one loop 
computations have been made of the fermion self-energy,!??:?!4 the scalar self- 
mass-squared,?!> and the effective potential.?!© There is also a two loop com- 
putation of the coincident vertex function.?!® These calculations show that the 
inflationary production of scalars affects super-horizon fermions like a mass, while 
the scalar mass remains small. The fermion mass decreases the vacuum energy 
without bound in such a way that the universe eventually undergoes a Big Rip 
singularity.?!6 

e For scalar quantum electrodynamics, one loop computations have been made of 
the vacuum polarization from massless scalars!?2:2!” — and also slightly massive 
scalars?!8 — of the scalar self-mass-squared,!9!19?-1% and of the effective poten- 
tial.219:220 Much more difficult two loop computations have been made for the 
square of the scalar field strength and for its kinetic energy,!** for two coincident 
field strength tensors,?2! and for the expectation value of the stress tensor.??! 
These calculations show that the inflationary production of charged scalars causes 
super-horizon photons behave as if they were massive, while the scalar remains 
light and the vacuum energy decreases slightly.??°22? Comoving observers, at 
an exponentially increasing distance from the sources, perceive screening of elec- 
tric charges and magnetic dipoles, while observers at a fixed invariant distance 
perceive the same sources to be enhanced.??% 

e The nonlinear sigma model has been exploited to better understand the deriva- 
tive interactions of quantum gravity,??4 and explicit two loop results have been 
obtained for the expectation value of the stress tensor.??°-??° These computations 
show that while the leading secular corrections to the stress tensor cancel, there 
are sub-leading corrections. 


Spectator scalar effects are simpler than those of gravitons, and generally stronger 
because they can avoid derivative interactions. Although scalar effects avoid the 
gauge issue, they are less universal because they depend upon the existence of 
light, minimally coupled scalars at inflationary scales. In five models with gravitons 
there are complete, dimensionally regulated and fully renormalized results: 


e For pure quantum gravity, the graviton one-point function has been computed 
at one loop order.??” This shows that the effect of inflationary gravitons at one 
loop order is a slight increase in the cosmological constant. 
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e For quantum gravity plus a massless fermion, the fermion self-energy has been 
computed at one loop order.!?!:??8 This shows that spin-spin interactions with 
inflationary gravitons drive the fermion field strength up by an amount that 
increases without bound.?2%299 

e For quantum gravity plus a slightly massive fermion, the fermion self-energy has 
been computed to lowest order in the mass.?3! However, the dynamical conse- 
quences of this have not yet been explored. 

e For quantum gravity plus a massless, minimally coupled scalar there are one loop 
computations of the scalar self-mass-squared?*? and the graviton self-energy.??% 
The effective field equations reveal that the scalar kinetic energy redshifts too 


rapidly for it to experience a significant effect from inflationary gravitons,?3+ 


or 
for a graviton to experience significant effects from inflationary scalars.?!29> The 
effects of inflationary scalars on the force of gravity, are still under study. 

e For quantum gravity plus electromagnetism, there is a one loop computation 
of the vacuum polarization.726 Even though gravitons are uncharged, they do 
carry momentum which can be added to virtual photons to make comoving 
observers perceive an enhanced force, while static observers experience no secular 
change.?°” The effect for magnetic dipoles is opposite.?2” The effect on dynamical 
photons is still being studied. 


It is worth noting that all these computations were made with the simple graviton 
propagator (160) in the noncovariant gauge (159). 

Dolgov did a very early computation of the small cosmological contribution to 
the vacuum polarization from fermionic quantum electrodynamics.2°° Another early 
result is Ford’s approximate evaluation of the one loop graviton one-point function 
in pure quantum gravity.2°° A more recent computation was made using adiabatic 
regularization.74° Momentum cutoff computations have also been performed of the 
one loop graviton self-energy?! and the two loop graviton one-point function.?* 
For gravity plus various sorts of scalars, the one loop scalar contribution to the 
noncoincident graviton self-energy has been computed.?** More recently, Kitamoto 
and Kitazawa have been exploring IR graviton corrections to various couplings in 
the off-shell effective field equations.24* 248 

Given a Lagrangian, it is quite simple to work out the interaction vertices in 
any background. The obstacle to making computations is finding the propagators. 
In Sec. 5.2 I have given all of them, in all known gauges, so anyone wishing to 
make computations should be able to do so. In spite of all the work that has been 
done, many interesting calculations remain which can be done within the existing 
formalism: 


e Use the previously derived one loop graviton contribution to the vacuum polar- 
ization to work out the effects on dynamical photons.?%° 

e Use the previously derived one loop scalar contribution to the graviton self- 
energy2!!)33 to work out corrections to the force of gravity. 
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e Perform a fully dimensionally regulated and renormalized computation of the one 
loop graviton contribution to the graviton self-energy in the noncovariant gauge 
(159) and use it to work out, what happens to dynamical gravitons?*® and to the 
force of gravity. 

e Re-compute the one loop graviton contributions to the scalar, fermion, vector and 
graviton 1PI two-point functions in a general de Sitter invariant gauge (171). This 
is especially important to check the conjecture? that the leading IR logarithms 
might be gauge independent. 

e Compute the two loop expectation value of C79 (z)Cog45(a) to see if the secular 
corrections which are certainly present in individual diagrams cancel out all the 
diagrams are added to produce a scalar. 


There are also two improvements of the basic formalism which need to be 
developed: 


e Work out corrections to the various initial states; and 
e Develop reasonable and consistent approximations for including the effects of 
e(t) £ 0 in realistic models of inflation. 


The state corrections would allow us to solve the effective field equations at all 
times, rather than only at asymptotically late-times. This is important to check for 
corrections to field strengths which develop for a while and then approach a constant 
at late-times when the particle under study has redshifted too much to interact 
further with the sea of IR quanta produced by inflation. It has been suggested that 
this could produce an observable k-dependent tilt to the scalar power spectrum.?34 
The goal of the second improvement would be to finally perform complete and fully 
renormalized computations of the one loop corrections to the power spectra. Note 
from Sec. 5.2 that the technology for representing vector and tensor propagators 
and 1PI functions can be applied to any geometry, so the real problem is just 
generalizing the scalar propagator to arbitrary €(t).?°° 

Finally, there are a number of less focused but very important issues which 
require study: 


e Develop gauge independent and physically reasonable ways of quantifying 
changes in particle dynamics due to inflationary particle production; and 

e Work out observable consequences of the interactions other particles have with 
inflationary gravitons and scalars. 


The first point is closely related to the important issue I discussed in Sec. 4.3 of 
how we correspond the observed power spectra to quantities in fundamental theory. 
Of course the second issue is crucial if the calculations described in this section are 
ever to be tested. One obvious point of phenomenological contact is the origin of 
cosmic magnetic fields.7°!75? It has been suggested that the vacuum polarization 
induced during inflation by a light, charged scalar — possibly part of the Higgs 
doublet — might produce the super-horizon correlations needed to seed cosmic 
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magnetic fields.2°? 25” More needs to be done in connecting the undoubted infla- 
tionary effect to post-inflationary cosmology. Another point of phenomenological 
contact might be inflationary baryogenesis.?°° More generally, we need to under- 
stand what happens after inflation to the ubiquitous factors of “H” which occur 
in one loop corrections to scalar effective potentials from fermions,?!® from gauge 


particles??? and from other scalars.?°? 


5.4. Back-Reaction 


I commented in Sec. 3.1 on the fine-tuning issues of single-scalar inflation. The 
worst of these is the problem of the cosmological constant.®®? Many people have 
suspected that quantum effects associated with a positive cosmological constant 
might lead to the cosmological constant screening itself. As far as I know, the first 
person to suggest this was Sasha Polyakov back in 1982.7 Variations of the same 
idea were put forward in the 1980s by Myhrvold,?6! Mottola,?®? Ford,??9 Antoniadis 
et al.2°3 Mazur and Mottola?®* and Antoniadis and Mottola.?©°:?66 

In 1992 Tsamis and I proposed a model of inflation based on three con- 


tentions?67,268. 


e The bare cosmological constant is not unreasonably small; 

e This triggered primordial inflation; and 

e Inflation was brought to an end by the gradual accumulation of self-gravitation 
between IR gravitons which are ripped out of the vacuum by the acceler- 
ated expansion and whose gravitational attraction is still holding the universe 
together. 


This is the idea that provoked the hostile referee report from which I quoted at 
the beginning of Sec. 4.1. It still amazes me that our model generates so much 
opposition because it is really a natural way to start inflation, and to make it 
last a long time without fine-tuning. Our proposal is also wonderfully economical, 
making a virtue out of the absence of any reason for the bare cosmological con- 
stant to be small, and using gravity to solve a gravitational problem with no new 
particles. I will devote the remainder of this section to discussing our model but 
one should take note of significant work on related proposals by Polyakov,2®° 2” 
by Mottola and collaborators,2”* 2®° by Brandenberger and collaborators,2°° 79? by 
Boyanovsky, de Vega and Sanchez,?°* 29° by Boyanovsky and Holman,?°° by Marolf 
and Morrison,?°" 2°° and by Akhmedov and collaborators.20° 3% 

Of course scepticism about my proposal with Tsamis is centered on the crucial 
third bullet point,?!° which we have so far been unable to prove but is at least not 
obviously wrong.?!! IR gravitons are certainly produced during inflation and it is 
difficult to understand why they would not attract one another, at least a little. 
It is also difficult to understand how that effect, which starts from zero, can avoid 
growing as more and more of the newly created gravitons come into contact with 
one another. Indeed, it is easy to show that as few as 10 e-foldings of inflation 
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produces enough IR gravitons to make the universe collapse if they were all in 
causal contact.3!2:313 

One reason we have not been able to demonstrate the third point is that it 
requires the finite part of a two loop computation in quantum gravity on de Sitter 
background. This might seem confusing after my statement in Sec. 3.3 that the 
primordial power spectra are tree order effects but there is not any contradiction. 
The diagram which represents how the tree order power spectra of Fig. 3 change 
the background geometry is obtained by attaching the two end points to a three- 
point vertex and connecting this to an external line, as in the leftmost diagram 
of Fig. 6. That one loop diagram gives the average gravitational response to the 
lowest order stress-energy of inflationary gravitons. It is constant and must actually 
be absorbed into a renormalization of the cosmological constant if the universe is 
really to begin inflating at the stated rate.??" (This is just like the renormalization 
condition of flat space quantum electrodynamics which makes the mass that appears 
in the electron propagator agree with the physical electron mass). The effect of self- 
gravitation between inflationary gravitons begins at the next order, for example, 
in the right-hand side diagram of Fig. 6. This diagram can show secular growth in 
the Schwinger—Keldysh formalism because the internal vertex points are integrated 
over the past light-cone of the external point, which grows as one waits to later and 
later times. 

Another reason we have not yet been able to prove the third point is that it 
requires an invariant quantification of the expansion rate. We actually performed 
a two loop computation of the graviton one-point function in a fixed gauge (but in 
D = 4, with a three-momentum cutoff that I do not trust) which does show slowing 
if we infer the expansion rate from the expectation value of the metric the same way 
one does from a classical metric.?41 Although the two loop computation consumed 
a year’s time (with a computer!), using it to get the expansion rate was simple 
because the expectation value of the metric in a homogeneous and isotropic state is 
itself homogeneous and isotropic, so the procedure is identical to the passage from 
expression (1) to (2). However, Unruh noted correctly that one cannot treat the 
expectation value of the metric as a metric.*!4 The correct procedure is to instead 
define an invariant operator which quantifies the expansion rate, and then compute 
its expectation value, just as I did for Green’s functions in my long-forgotten thesis 
work.!® So my career has come full circle! 


Fig. 6. One loop and two loop contributions to the graviton one-point function in pure quantum 
gravity. 
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The importance of Unruh’s observation was demonstrated by another pro- 
posal for significant back-reaction in scalar-driven inflation. Although the lowest 
order effect in pure gravity is at two loops, Mukhanov et al. realized in 1996 
that mixing with dynamical scalars affords scalar-driven inflation the opportu- 
nity of showing back-reaction at one loop order. Their initial work seemed to 
show such an effect.31°3!6 In 1998 Raul Abramo and I confirmed that their 
result appears as well in different gauges.?!%3!8 However, the effect was absent 
when we formulated an invariant operator to quantify the expansion rate and 
computed its expectation value in 2001.°193?! A better expansion invariant 
was proposed and computed in 2002 by Geshnizjani and Brandenberger, with 
the same result.?2? Subsequent refinements and extensions have been made by 
Morozzi and Vacca.323 326 

Pure quantum gravity lacks the scalar upon which the expansion observable 
of Geshnizjani and Brandenberger is based. However, Tsamis and I were eventu- 
ally able to devise what seems to be a reasonable substitute based on a nonlocal 
functional of the metric.?2” With Shun-Pei Miao we are currently engaged in com- 
puting its expectation value at one loop order. There should be no effect at one 
loop, but demonstrating that in a dimensionally regulated and fully renormalized 
computation is an important test of the observable. If it passes then we can begin 
the year-long labor involved in computing its expectation value at two loop order. 
Reaching that order will require a determination of the one loop corrections to the 
initial state. 

Although computing the expectation value of the expansion observable is a 
worthy goal, it will not end the controversy. The predicted time dependence takes 
the form,22%224 


A(t) = ih nGHs x 04 (“) x K ln | at) | + 0(G8 neteyh (186) 


3 oi a(t2) 


where Ho is the expansion rate of an initially empty universe released at time t = fo. 
Determining the value of K is expected to require a year. Assuming it is negative, we 
would have proven that the production of inflationary gravitons slows inflation by 
an amount which eventually becomes nonperturbatively strong. However, this only 
means that perturbation theory breaks down, not that inflation eventually stops. 
The unknown higher loop contributions might push the universe into deceleration, 
or they might sum up to produce only a small fractional decrease in the expansion 
rate. Both cases occur in scalar models.?!®22° 

Working out what happens requires some sort of nonperturbative resummation 
technique. For scalar potential models (with nondynamical gravity) Starobin- 
sky and Yokoyama showed how to sum the series of leading logarithms of the 
scale factor a(t).??4328329 This technique was successfully extended to a Yukawa- 


coupled scalar?!® and to scalar quantum electrodynamics,” but there is still no 
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generalization to quantum gravity.2°° One important piece of recent progress is the 
demonstration that a classical configuration of gravitational radiation can indeed 
hold the universe together for a least a little while.28° This is crucial because par- 
ticles created during inflation decohere so that their physics is essentially classical 
at late-times. If Tsamis and I are right, it must be a classical configuration of gray- 
itational radiation which is holding the universe together today. The existence of 
such a configuration also shows that inflationary particle production can actually 
stop inflation, if only it can produce the necessary configuration. 

There is absolutely no doubt that individual two loop diagrams make secular 
contributions to the expansion observable of the form (186). Sceptics believe that all 
such contributions will cancel when the many, many diagrams are summed to pro- 
duce a gauge invariant result. This certainly does not happen in scalar models which 
show the same sort of secular growth.?!®229 Sceptics are convinced that gravity 
will be different because the growth factors derive from long wavelength gravitons 
which are nearly constant, and exactly constant gravitons are gauge equivalent to 
zero. Of course the key distinction is between approximately constant and exactly 
constant. Sceptics insist that there can be no gravitational effects unless there is 
curvature, and effects must be small if the curvature is small. I take Lagrangians 
more seriously than pre-conceived opinions, and there is no doubt that matter (and 
the metric itself) couples to the metric, not to the curvature. Hence it must be 
possible, under certain circumstances, for these couplings to produce effects even 
in places and at times when the curvature of the original source has redshifted to 
nearly zero. 

The computation will decide who is right, but it is fitting to end this section by 
commenting on the similarity of this controversy to a classic dispute in the history of 
quantum electrodynamics. Electromagnetic Lagrangians show unambiguously that 
quantum matter couples to the electromagnetic vector potential rather than to the 
field strength, hence it must be possible for these couplings to produce effects even 
at times and in places where the field strength is small. However, physicists are 
as prone to prejudice as other humans and generations of theorists ignored what 
the theory was telling them and insisted that nothing can happen unless the field 
strength is nonzero. In 1949 Ehrenberg and Siday predicted**! what is better known 
today as the Aharonov-Bohm effect.*°? We all know how the experiment turned 
out. Iam confident the quantum gravitational reprise will have a similar outcome, 
but the important point for this paper is that posing such classic questions, and 
assembling the technology to answer them, is one more example of how quantum 
gravity is becoming a mature subject. 


6. Conclusions 


Another man described another revolutionary era in the following terms: Jt was 
the best of times, it was the worst of times. Fortunately, no one is likely to get his 
head chopped off as quantum gravity makes the difficult passage to maturity! It is 
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nonetheless a time of great change, and that affects different physicists in different 
ways. Some are delighted by the opportunity to participate in founding a new 
field; others want the changes to stop with the recognition of their own work; and 
some pretend that nothing is changing. The majority of those who call themselves 
quantum gravity researchers are in the last camp. To them the key problem of 
quantum gravity is resolving what happens to black holes in the final stages of 
evaporation, a process for which we have no data and are not likely to acquire data 
any time soon. Many of these people are my friends and I wish them well, but the 
last three decades of unrequited toil in string theory have proven that humans are 
not good at guessing fundamental theory without the guidance of data. 

Inflationary cosmology is already changing the way we think about quantum 
gravity. For example, neither discretization nor simply refusing to quantize the 
metric are any longer viable solutions to the divergence problem of quantum grav- 
ity.'4 Cosmology has required us to abandon in-out quantum field theory because 
the universe began with an initial singularity and no one knows how it will end, or 
even if it will end. And releasing the universe in a prepared state at a finite time 
implies we must stop dodging the issue of perturbative corrections to the initial 
state. 

Inflationary cosmology is also forcing us to re-think the issue of observables. For 
example, it used to be pronounced, with haughty dogmatism, that the S-matrix 
defines all that is observable about a quantum field theory. That statement was 
always dubious, but we now have counterexamples — which cannot be dismissed 
because they are being observed — in the form of the primordial power spectra 
discussed in Sec. 3. The recent controversy over the use of adiabatic regularization 
(see Sec. 3.4) makes it clear that we need to think more carefully about how to 
connect theory to observation even at tree order. The one loop corrections discussed 
in Sec. 4 — which may well be observable in a few decades — pose the further 
problem of how to define loop corrections to the power spectrum so that they are 
IR, finite, ultraviolet renormalizable, and so that they show the expected pattern 
of e-suppression. And we are just beginning to understand what other quantum 
gravitational effects (see Sec. 5) might be engendered by inflationary gravitons and 
scalars. 

Some people dismiss the impact of inflationary cosmology on quantum gravity 
because the detection of primordial gravitons is still tentative?”°2-8% 
fundamental theory has yet to provide a compelling model of primordial inflation. 
I believe these objections are purblind and transient. Within no more than a few 
years we will know whether or not BICEP2 has detected primordial gravitons.°° It 
is not so easy to forecast when fundamental theorists will bring forth a compelling 
model of inflation. The six fine-tuning problems I mentioned in Sec. 3.1 indicate 
that there is something very wrong with our current thinking. I suspect only a 
painful collision with data is going to straighten us out. Fortunately, the continuing 
flow of data makes that collision inevitable. When our observational colleagues have 
finally uncovered enough of the truth for us to guess the rest it will be possible to 


and because 
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replace the rough estimates (72) of the tree order power spectra with precise results, 
and also to compute loop corrections reliably, both for the power spectra and for 
other quantum gravitational effects. In the fullness of time all of these predictions 
will be tested by the data present in 21cm radiation. When that process has gone 
to completion perturbative quantum gravity will assume its place as a possession 
for the ages. 
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The discovery in the early 1970s that black holes radiate as black bodies has radically 
affected our understanding of general relativity, and offered us some early hints about 
the nature of quantum gravity. In this paper, I will review the discovery of black hole 
thermodynamics and summarize the many independent ways of obtaining the thermody- 
namic and (perhaps) statistical mechanical properties of black holes. I will then describe 
some of the remaining puzzles, including the nature of the quantum microstates, the 
problem of universality, and the information loss paradox. 


Keywords: Black hole thermodynamics; Hawking radiation; Bekenstein-Hawking ent- 
ropy. 


PACS Number(s): 04.70.Dy, 04.60.—m 


1. Introduction 


The surprising discovery that black holes behave as thermodynamic objects has 
radically affected our understanding of general relativity and its relationship to 
quantum field theory. In the early 1970s, Bekenstein'? and Hawking** showed 
that black holes radiate as black bodies, with characteristic temperatures and 
entropies 


ht Ahor 


kT = On? Seu = AAG’ (1) 


where « is the surface gravity and Apo, is the area of the horizon. These quanti- 


ties appear to be inherently quantum gravitational, in the sense that they depend 
on both Planck’s constant h and Newton’s constant G. The resulting black body 
radiation, Hawking radiation, has not yet been directly observed: The tempera- 
ture of an astrophysical black hole is on the order of a microkelvin, far lower than 
the cosmic microwave background temperature. But the Hawking temperature and 
the Bekenstein—Hawking entropy have been derived in so many independent ways, 
in different settings and with different assumptions, that it seems extraordinarily 
unlikely that they are not real. 

In ordinary thermodynamic systems, thermal properties reflect the statistical 
mechanics of underlying microstates. The temperature of a cup of tea is a measure 


T]-415 
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of the average energy of its molecules. Its entropy is a measure of the number of 
possible microscopic arrangements of those molecules. It seems natural to suppose 
the same to be true for black holes, in which case the temperature and entropy 
(1) might tell us something about the underlying quantum gravitational states. 
This idea has been used as a consistency check for a number of proposed models of 
quantum gravity, and has suggested important new directions for research. 

In one key aspect, though, black hole entropy is atypical. For an ordinary non- 
gravitational system, entropy is extensive, scaling as volume. Black hole entropy, on 
the contrary, is “holographic,” scaling as area. If the Bekenstein-Hawking entropy 
really counts black hole microstates, this holographic scaling suggests that a black 
hole has far fewer degrees of freedom than we might expect. But it has also been 
argued that a black hole provides an upper bound on the number of degrees of 
freedom in a given volume: If one tries to pack too many degrees of freedom into a 
region of space, they may inevitably collapse to form a black hole. This has led to 
the conjecture that all of nature may be fundamentally holographic, and that our 
usual counting of local degrees of freedom vastly overestimates their number. 

Black hole thermodynamics leads to a number of other interesting puzzles as 
well. One is the “problem of universality.” If the Bekenstein—Hawing entropy Spy 
counts the black hole degrees of freedom in an underlying quantum theory of gravity, 
one might expect that the result would depend on that theory. Oddly, though, 
the expression (1) can be obtained from a number of very different approaches to 
quantum gravity, from string theory to loop quantum gravity to induced gravity, in 
which the microscopic states appear to be quite different. This suggests the existence 
of an underlying structure shared by all of these approaches, but although we have 
some promising ideas, we do not know what that structure is. 

A second puzzle is the “information loss paradox.” Consider a black hole initially 
formed by the collapse of matter in a pure quantum state, which then evaporates 
completely into Hawking radiation. If the radiation is genuinely thermal, this would 
represent a transition from a pure state to a mixed state, a process that violates 
unitarity of evolution and is forbidden in ordinary quantum mechanics. If, on the 
other hand, the Hawking radiation is secretly a pure state, this would appear to 
require correlations between “early” and “late” Hawking particles that have never 
been in causal contact. The problem was recently sharpened by Almheiri et al.,° who 
argue that one must sacrifice some cherished principle: the equivalence principle, 
low energy effective field theory, or the nonexistence of high-entropy “remnants” 
at the end of black hole evaporation. These issues are currently areas of active 
research, and the matter remains in flux. 


2. Prehistory: Black Hole Mechanics and Wheeler’s Cup of Tea 


The prehistory of black hole thermodynamics might be traced back to Hawking’s 
1972 proof that the area of an event horizon can never decrease.® This prop- 
erty is reminiscent of the second law of thermodynamics, and the correspondence 
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was strengthened with the discovery of analogs of other laws of thermodynam- 
ics. This work culminated in the publication by Bardeen et al.’ of the “four laws 
of black hole mechanics”: For a stationary asymptotically flat black hole in four 
dimensions, uniquely characterized by a mass M, an angular momentum J, and a 
charge Q,® 


(0) The surface gravity « is constant over the event horizon. 


(1) For two stationary black holes differing only by small variations in the 
parameters M, J and Q 


5M = ——~6Ahor + Ny ST + Oy 6Q, (2) 
87G 
where (7; is the angular velocity and ®,, is the electric potential at the 
horizon. 
(2) The area of the event horizon of a black hole never decreases, 
dAhor = 0. (3) 


(3) It is impossible by any procedure to reduce the surface gravity « to zero 
in a finite number of steps. 


These laws are numbered in parallel with the four usual laws of thermodynamics, 
and they are clearly formally analogous, with « playing the role of temperature and 
Ahor of entropy. As in ordinary thermodynamics, there are a number of formulations 
of the third law, not all strictly equivalent. For a summary of the current status, 
see Ref. 8. While the four laws were originally formulated for four-dimensional 
“electrovac” spacetimes, they can be extended to more dimensions, more charges 
and angular momenta, and to other “black” objects such as black strings, rings, and 
branes. The first law, in particular, holds for arbitrary isolated horizons,?!? and for 
much more general gravitational actions, for which the entropy can be understood 
as a Noether charge.!? 

But despite the formal analogy with the laws of thermodynamics, Bardeen, 
Carter and Hawking argued that « cannot be a true temperature and Ajo, cannot 
be a true entropy. The defining characteristic of temperature, after all, is that heat 
flows from hot to cold. But if one places a classical black hole in contact with a 
heat reservoir of any temperature, energy will flow into the black hole, but never 
out. A classical black hole thus has a temperature of absolute zero. 

A second thread in the development of black hole thermodynamics began in 
1972, when John Wheeler asked his student, Jacob Bekenstein, what would happen 
if he dropped his cup of tea into a passing black hole.!? The initial state would be 


8See the Appendix for a brief summary of the relevant classical black hole physics. 
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a cup of tea (with a nonzero entropy) plus a back hole. The final state would be no 
tea, but a slightly larger black hole. Where did the entropy go? 

Bekenstein’s ultimate answer was that the black hole must have an entropy, 
which, he contended, must be proportional to its horizon area. Using a series of 
thought experiments,!? he argued further that the entropy ought to be of the form 


= 15 (4) 
where 77 is a constant of order one. For example, a spherical particle with a proper 
radius equal to its Compton wavelength will increase the entropy (4) by an amount 
of order one bit,? a result that is, remarkably, independent of the spin and charge 
of the black hole. A small harmonic oscillator dropped into a black hole will 
increase (4) by an amount at least as great as its entropy,” as will a beam of black 
body radiation.! Bekenstein’s “generalized second law of thermodynamics” — the 
claim that the total of ordinary entropy of matter plus black hole entropy Spy 
never decreases — has now been tested, and confirmed, over a very wide range of 
settings.8:19:14 


3. Hawking Radiation 


Despite the intriguing parallels between area and entropy, the fact that a clas- 
sical black hole had temperature JT’ = 0 seemed to present an insurmountable 
obstacle to identifying the two. Zel’dovich had suggested in 1970 that quantum 
effects might be relevant, causing spinning black holes to radiate in particular 
modes,!® but the arguments were not worked out in detail. In 1974, however, 
Hawking*+ used newly developed techniques for treating quantum field theory in 
curved spacetime!” to show that all black holes radiate, with a black body tem- 
perature 7}, With this identification of temperature, the first law of black hole 
mechanics (2) determines the entropy, and Bekenstein’s expression (4) is confirmed, 
with n = 1/4.> 

There are several ways to describe Hawking’s results. Perhaps the most intuitive 
is to say that quantum mechanics allows particles to tunnel out through the event 
horizon. But while Hawking himself used this as a heuristic picture,* the full math- 
ematical description of such a tunneling process was not worked out until much 
later.!8 Hawking’s calculation was based, rather, on a key feature of quantum field 
theory in curved spacetime: the fact that the vacuum is not unique, but depends 
on a choice of time. In particular, Hawking showed that the Minkowski vacuum of 
a past observer watching the collapse of a star differed from the vacuum of a future 
observer looking at the resulting black hole. 


bStrictly speaking, the first law determines black hole entropy only up to an additive constant. 
But Pretoruis et al. have designed an (idealized) process that reversibly forms a black hole with 
entropy Sgu, strongly hinting that no additive constant appears (Ref. 16). 
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3.1. Quantum field theory in curved spacetime 


To be more explicit, consider a free massless scalar field y. In Minkowski space, the 
standard approach to quantization!® is to first expand y in its Fourier modes 


y= (axfe(t,x) tah fi(t,x)) with fic = cee, we = [kl]. (5) 
k 
The fx are positive frequency, positive energy modes, which may be completely 
characterized by the conditions 
fic(t, x) = (); On fie (t, X) — —iwk f(t, x) (with Wk > 0). (6) 


Similarly, the fg are negative frequency, negative energy modes. The theory has a 
natural scalar product 


ena i n°(v1 dav) Vinee, (7) 


where © is an arbitrary Cauchy surface and n* is its unit normal. The modes (6) 
are then orthonormal, provided that we choose the normalization constants 


1 
Cor 


To quantize the theory, we now interpret the coefficients of the f, as annihilation 
operators, and the coefficients of the ff as creation operators. These obey the 
standard commutation relations, and can be used to construct the usual Fock space 
of free particle states. In particular, the vacuum is the state annihilated by the a, 


M.|0) = 0. (9) 


(8) 


In a more general spacetime, or even a noninertial frame in Minkowski space, the 
standard Fourier modes no longer exist. Suppose, though, that we can still choose a 
preferred time coordinate t. We can then find a complete set of orthonormal modes 
satisfying (6), perform an expansion of the form (5) with respect to these modes, 
and once again define creation and annihilation operators and a vacuum state. 

What is new, though, is that there may now be more than one preferred time. 
Given two choices, say t and t’, two expansions exist 


p= Sah + af) = Saif + a"). (10) 


7 


Furthermore, since the {f;, {7} are a complete set of functions, we can write 


fi => @nhit Biff) (11) 


tu 
for some coefficients aj; and 3;;. This relation is known as a Bogoliubov transforma- 
tion, and the coefficients aj; and 3;; are Bogoliubov coefficients.2° The coefficients 
may be read off from (11) by using the inner product (7). In particular 


By = —(Fi, Ff). (12) 
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We now have two vacua, a state |0) annihilated by the a; and a state |0’) 
annihilated by the a/, and two number operators, N; = ala; and N/ = aia u .. Using 
(11) and the arthononnality of the mode functions, it may be easily dhadked that 


(0'|.N;|0’) =o |Byal?. (13) 


Thus, if the coefficients @;; are not all zero, the “primed” vacuum will have a 
nonvanishing “unprimed” particle content. 


3.2. Hawking’s calculation 


In his seminal work on black hole radiation and evaporation,?4 Hawking computed 
the Bogoliubov coefficients between an initial vacuum outside a collapsing star anda 
final vacuum after the formation of a black hole. He showed that a “primed” observer 
in the distant future would see a thermal distribution of particles at the Hawking 
temperature (1). While Hawking’s full calculation is too technical to include in this 
review — see, for example, Traschen’s very nice paper?! for details — it is possible 
to sketch out the basic argument. 

Hawking’s starting point is the collapse of a star to form a black hole, as shown in 
the Carter—Penrose diagram of Fig. 1(a). The shaded area is the collapsing star. The 
horizon 1 forms at some time after the collapse has begun. In the distant past, the 
spacetime far from the star is nearly flat, and we can define a standard Minkowski 
vacuum at past null infinity Z~. In the distant future, the black hole has settled 
down to a (nearly) stationary configuration, and we can define a vacuum at future 
null infinity Z*. From the previous discussion, we need to find the orthonormal 
modes at J+, propagate them to a common Cauchy surface — © in Fig. 1(b) — 
and take the inner product (12) to determine the Bogoliubov coefficients. 


Fig. 1. The setup for Hawking’s calculation. (a) Carter—Penrose diagram for a star collapsing to 
form a black hole. (b) Coordinates, ray-tracing, and mode functions. 
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For the first step, let us choose null coordinates u (“retarded time”) and v 
(“advanced time”) in the exterior of the black hole, chosen so that u is an affine 
parameter along Z* and v is an affine parameter along Z—, as shown in Fig. 1(b). It 
is easy to check that the affine condition implies that near 7+, the metric takes the 
form ds? = dudv + angular terms. As a result, the Klein—-Gordon equation greatly 
simplifies, and the asymptotic modes become 


fuem ~ Vim (8, dye” near Z-, 
: (14) 
fan ~Yim(9, dye" near I+. 

To calculate the inner product (12), Hawking evolved the mode f/,,,,, from Z* 
backwards to Z~. In principle, this requires solving the Klein—Gordon equation 
with boundary conditions at Z*. For the relevant frequencies, though, the geometric 
optics approximation, in which massless particles move along null geodesics, is good 
enough. The dotted line labeled y in Fig. 1(b) shows a null geodesic propagating 
backwards from Z*, “reflecting” off r = 0, and continuing to Z~. (One may visualize 
this as an ingoing spherical wave shrinking to a point at the origin, passing through 
itself, and expanding outward.) This geodesic maps a point u on Zt to a point p(w) 
on Z~, and the mode functions f’,,,,, in turn map to functions of the form 


~Yom(0,p)eP . (15) 


The meat of Hawking’s calculation is the determination of this function p(w), 
which he finds by tracking the position of the geodesic ¥y relative to the horizon H. 
The result is that 


piu) ~ A— Be™, (16) 


where « is the surface gravity and A and B are constants. The exponential depen- 
dence on uw reflects the extreme blue shift of waves passing near the horizon: Two 
successive crests at Z+, labeled by u and u+ du, will be mapped to a separation 
p'(ujou ~ KBe~*“du at I~. Viewed in the opposite direction, the function p(u) 
describes the “peeling” of null geodesics away from the horizon.?? 

The computation of the Bogoliubov coefficients is now straightforward. The 
inner product (12) on a Cauchy surface © near Z~ now involves integrals of the 


form 
iw'v ,—i¥ Inv 
dvew Veta”, 


These yield gamma functions of complex arguments, whose absolute squares give 
the exponential behavior of a thermal distribution with the Hawking temperature. 

I have left out many steps, of course. In particular, future null infinity Z* is not 
a Cauchy surface, so the modes f’,,,,, in (14) are not a complete set. They must be 


TI-422 S. Carlip 


supplemented by modes on the horizon H. These account for backscattering into 
the black hole. This scattering is energy-dependent, and distorts the black body 
spectrum, yielding instead a “grey body spectrum” 


(N) = at (17) 


Qrw 


oe ad. 
where the reflection coefficient I',, can be calculated explicitly.?*:24 Generalizations 
to particles other than massless scalars are straightforward, and give a consistent 
thermodynamic picture. 


4. Back-of-the-Envelope Estimates 


The description I have given of Hawking’s derivation of black hole radiation is 
not, for most people, terribly intuitive. It is useful to understand some back-of- 
the-envelope estimates, which can give a better feel for the physics. The following 
examples, illustrated in Figs. 2 and 3, are taken from a set of my earlier lectures,”° 


slightly updated. 


4.1. Entropy 


We start with an estimate of black hole entropy, using the generalized second law.?° 
Consider a cubic box of gas with volume ¢°, mass m, and temperature TJ’, dropped 
into a Schwarzschild black hole of mass M (and therefore having a horizon area 
of A = 167G?M7). Let us assume that the box is at least as large as the thermal 
wavelength of the gas,  ~ h/T. Then the descent of the gas into the black hole will 
lead to a decrease of entropy 


m me 
ASeas ¥ —= Y - 18 
gas F h ( ) 
The box of gas will coalesce with the black hole when its proper distance p from 


the horizon is of order ¢. For a Schwarzschild black hole, this proper distance is 


2GM-+or dr 
p= Ss JMG, 
2 


GM /1—2GM 
ie 
ie 


box of gas 7 


mass m, temperature T 
black hole 


mass M 


Fig. 2. A thought experiment for Bekenstein—Hawking entropy. 
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virtual 


pair horizon radius ~ 2GM/c? 


Fig. 3. A thought experiment for Hawking temperature. 


so p ~ £ when or ~ ¢?/GM. The gas initially has mass m, but its energy as seen 
from infinity is redshifted as the box falls toward the black hole. When the box 
reaches r = 2GM + ér, the black hole will gain a mass 


2GM me 
AM mA Soar op GM 


The change in the horizon area is thus 


AA~ G?MAM ~ Gmé. (19) 
Comparing (18) and (19), we see that the black hole must gain an entropy 
AA 
ASgu © —ASgas © ck (20) 


4.2. Temperature 


We next estimate the Hawking temperature, at a similar level of hand-waving.?” 
As in Sec. 3, we consider inequivalent vacua, but following a heuristic approach 
suggested by Hawking,* we now compare vacua inside and outside the horizon. 
As usual in quantum field theory, let us view the vacuum as a sea of virtual 
pairs of particles, each pair consisting of a particle with positive energy E relative 
to an observer at infinity and a particle with negative energy —E. Normally, by the 
uncertainty principle, a negative energy particle can exist only for a time t ~ h/|F]. 
Recall, however, that the distinction between “positive” and “negative” energy can 
depend on the choice of time. For a Schwarzschild black hole, with a metric 


=] 
ds? = (1 am) dt? (1 am) dr* — 77 d0?, (21) 
r 


r 


the coordinate t is a time coordinate in the exterior, but a spatial coordinate in 
the interior, where the coefficient of dt? changes sign.° Similarly, the coordinate 


°Strictly speaking, the coordinates labeled r and t outside the horizon are not the same as those 
inside, since the two coordinate patches do not overlap. The argument can be rephrased in terms 
of proper times of infalling observers, though, in a way that avoids this issue. 


TI-424 S. Carlip 


r is a spatial coordinate in the exterior, but a time coordinate in the interior, 
where “forward in time” means “toward the center of the black hole.” A particle 
with negative energy relative to an exterior observer may thus have positive energy 
relative to an interior observer, and if it crosses the horizon quickly enough, this 
will allow us to evade the uncertainty principle. 

Now consider a virtual pair of massless particles momentarily at rest at a coor- 
dinate distance dr from the horizon. As in Sec. 4.1, the proper distance — and 
therefore the proper time for the negative energy partner to reach the horizon — 
will be 


T~VGMor. 


Setting this equal to the lifetime A/F of the pair, we find that 


h 
VGMor’ 
which should match the energy of the escaping positive-energy partner. This is the 


energy at 2GM + dr, though. As in the preceding section, the energy at infinity 
will be redshifted, becoming 


|B 


‘ h IGM h (22) 
© J/GMoir 29GM +6r GM’ 


independent of the initial position dr. We thus expect a black hole to radiate with 
a characteristic temperature kT ~ h/GM, matching the Hawking temperature (1). 


5. The Many Derivations of Black Hole Thermodynamics 


Hawking’s derivation of the black hole temperature seems to depend only on simple 
properties of quantum field theory in curved spacetime. Still, one may worry that 
these properties have been pushed beyond their realm of validity. In particular, it 
is evident from Sec. 3.2 that the calculation of the Bogoliubov coefficients requires 
that we follow outgoing modes backwards in time to a region in which they are very 
highly blueshifted, with energies far above the Planck scale.?® °° 

This “trans-Planckian problem” has been addressed in a number of ways: by 
imposing high-energy cut-offs,?9 by restricting fields to discrete lattices,*! by alter- 
ing dispersion relations to break the connection between high frequencies and high 
energies,??-3° and by considering analog systems such as Unruh’s “dumb holes,” fluid 
flows that generate sonic event horizons.?° It turns out to be very hard to “break” 
Hawking’s results: Standard thermal Hawking radiation, with the usual Hawking 
temperature, appears even in models that never involve Planck scale physics.°4 

Perhaps the strongest evidence for the reality of black hole thermodynamics, 
though, comes from the number of independent derivations, each relying on different 
assumptions and approximations. In this section, I will briefly summarize some of 
these results. 
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5.1. Other settings 


Hawking’s original derivation involved a particular physical setting, a star collapsing 
to form a black hole. This determined the choice of vacuum state, and one could 
worry that the result might be too narrow. It has been subsequently shown, though, 
that one can generalize the derivation considerably, for instance by replacing the 
initial Minkowski vacuum in the distant past with the vacuum for a freely falling 
observer near the horizon.®° Indeed, there is now fairly strong evidence that only 
a few fundamental features are needed to obtain thermal radiation: a vacuum near 
the horizon as seen by a freely falling observer, vacuum fluctuations that start in the 
ground state, and subsequent adiabatic evolution.**°° In particular, the Einstein 
field equations are not required.?” 

We may also ask which ingredients in the derivation of Sec. 3.2 were essential 
to the result.2? Consider any setting in which, as in Fig. 1(b), one can trace null 
geodesics (such as y in the figure) from part of Z* to part of Z—, defining a map p(w). 
One can then define a time-dependent “surface gravity” «(u) by 

W 
K(u) = ea) 
pi(u) 
Now pick a point wu, on Z+. Barcelé et al. show?? that an observer near u, will 
observe nearly thermal Hawking radiation, with a time-dependent temperature 
h«(ux)/27, provided an adiabatic condition 
Ki! (tx) < k(t)? (24) 


holds. In particular, it is not even necessary for an event horizon to form. The 


(23) 


exponential “peeling” (16) is sufficient. 

One can also analyze an eternal black hole at equilibrium with its Hawking radi- 
ation,®® obtaining consistent results. Moreover, the computation may be extended 
beyond the number operator (13) to find an expression for the full final state in 
terms of the initial vacuum. It is exactly thermal, at least in the approximation 
that one ignores back-reaction of Hawking radiation on the black hole.3%»*° 


5.2. Unruh radiation 


By the principle of equivalence, the gravitational field near a black hole horizon 
should be locally equivalent to uniform acceleration in a flat spacetime. Hawking 
radiation is not entirely local, so one might not expect an exact equivalence, but 
one should be able to test Hawking’s results by comparing the vacua of an inertial 
observer and a uniformly accelerating observer in Minkowski space. This was first 
done by Unruh.*° The Bogoliubov coefficients can be derived fairly easily, and the 
conclusion is that the accelerated observer with a constant proper acceleration a 
sees a thermal flux of particles with a temperature 
_ ha 

ae 


in almost exact analogy with the Hawking temperature (1). 


iG (25) 


TI-426 S. Carlip 


Coincidentally, at almost the same time as Unruh’s work, Bisognano and Wich- 
mann gave an independent, more difficult but mathematically rigorous proof of the 
Unruh effect in quantum field theory.*! Their results are quite general. While they 
require flat spacetime, the quantum field theory need not be a free theory. 


5.3. Particle detectors 


Hawking’s derivation and similar approaches to black hole thermodynamics depend 
heavily on the standard quantum field theoretical definitions of vacuum states, Fock 
space and particle number. But since one of the main conclusions is that states 
in curved spacetime are rather different from those we are used to in Minkowski 
space — for instance, the vacuum is no longer unique — we might worry that we are 
overinterpreting the formalism. True physical observables are notoriously difficult 
to find in quantum gravity, so this is not a trivial concern. 

To investigate this question, Unruh?° and DeWitt?” developed simple models 
of particle detectors in black hole backgrounds. They showed that such detectors 
do, in fact, see thermal radiation at the Hawking temperature. Similarly, Yu and 
Zhou*? have shown that a two-level atom outside a black hole will spontaneously 
excite as if it were in a thermal bath at the Hawking temperature. 


5.4. Tunneling 


Hawking radiation is a quantum process, and we may try to apply our intuition 
about quantum mechanics to seek a deeper understanding. As noted in Sec. 3, 
an obvious guess is that while escape from a black hole is classically forbidden, 
quantum particles might tunnel out. For instance, instead of visualizing virtual 
pairs forming outside the horizon as in Sec. 4.2, we might reverse the picture and 
imagine pairs forming inside the horizon, with one member then tunneling out. 

While the idea of a tunneling description can be traced to Damour and Ruffini** 
and Srinivasan and Padmanabhan,*° its modern incarnation owes itself to a key 
insight of Parikh and Wilczek,!® who realized that rather than thinking of a particle 
as tunneling through the horizon, one could think of the horizon as tunneling past 
the particle. Consider an initial Schwarzschild black hole of mass M that emits a 
particle in an s-wave state, represented as a spherical shell of mass w < M. In the 
WKB approximation, the tunneling rate is 


Pe en2imI/h (26) 


where J is the action for the outgoing shell, moving in a background metric of a 
black hole as it crosses the horizon from r;, to r,,,. The exponent, in turn, can be 
written as 

T out M-—w Touk dr 


Tout Pr 
Im J = Im p,dr = im [ dp}. dr =Im —dH, (27) 
Tin 0 M Tin r 


Tin 
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where I have used Hamilton’s equations of motion to write dp, = dH/r and noted 
that the horizon moves inward from GM to G(M — w) as the particle is emitted. 

To evaluate this integral, Parikh and Wilczek use the Painlevé—Gullstrand form 
of the black hole metric 


ds? = (1 =) ap [2 ee ga ah 


r i 


which avoids coordinate singularities at the horizon. Outgoing radial null geodesics 
then satisfy 
2G(M — 
as eat =e) 
- 
and the integral (27) can be evaluated by a standard contour deformation, yielding 
a tunneling rate 


T= e 87 G(M—$)/h _ eA Sx | (29) 


where AS}, is the change in the Bekenstein-Hawking entropy (1) due to the emis- 
sion of the shell. By the first law of thermodynamics, AS3y = —hw/T, and we 
recover the expected emission rate for thermal Hawking radiation. 

The tunneling approach was initially developed for the simplest black holes, but 
has by now been vastly extended, to apply to virtually every known black hole 
solution, including even nonstationary configurations.4® The method has also been 
generalized. In particular, one can replace the assumption that outgoing particles 
follow null geodesics by the more general approximation that the action I satisfies 
a Hamilton-Jacobi equation.*7 


5.5. Hawking radiation from anomalies 


From early on, it was clear that black hole thermodynamics should be visible in the 
behavior of the stress-energy tensor near a horizon.4?4® The computation of the 
expectation value (Ty) is a complicated and technical subject, beyond the scope 
of this paper (see for instance, the book by Birrell and Davies!® for a pedagogi- 
cal introduction). The results, however, are consistent with Hawking’s discovery. 
In particular, the heuristic approach of Sec. 4.2 is supported: An ingoing flux of 
negative energy near the horizon balances the outgoing flux of positive energy at 
infinity. Energy is conserved, and as in the tunneling derivation of Sec. 5.4, the 
back-reaction of Hawking radiation reduces the mass of the black hole. 

In general, the computation of the expectation value (T,») requires a mixture 
of analytic approximations and numerical methods. In one special case, though, 
the situation becomes drastically simpler. Consider a conformally invariant field 
(for instance, a massless scalar) in two spacetime dimensions. Classically, confor- 
mal invariance forces the trace T“, of the stress—energy tensor to vanish, and its 
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quantum correction, the “trace anomaly,” is completely determined as 
ro ee 
(T a 7 aan 

where c, the central charge, is fully determined by the properties of the field in a 
flat background.*® Christensen and Fulling®® realized in 1977 that in such a case, 
the full stress—energy tensor is completely determined from (T“,) by conservation 
laws and boundary conditions. The result exactly matched expectations, with fluxes 
of positive energy at infinity, describing thermal radiation, balanced by fluxes of 
negative energy through the horizon. 

Christensen and Fulling’s original analysis relied crucially on the restriction to 
two dimensions. More recently, though Wilczek and collaborators?! °? have shown 
how to generalize the argument to a much wider setting. The central idea is that 
one can impose an effective description of a vacuum state near the horizon, the 
equivalent of the freely falling observer’s vacuum, by excluding matter fields that 
correspond to outgoing (“horizon-skimming”) modes. This restriction makes the 
field theory chiral, though, and a chiral theory contains a new anomaly, a diffeo- 
morphism anomaly. By analyzing the resulting theory one partial wave at a time, 
one can again use this anomaly to reconstruct the stress—energy tensor, recovering 
the characteristic thermal radiation. This method is actually closely related to that 
of Christensen and Fulling — in two dimensions, one can always trade a conformal 
anomaly for a diffeomorphism anomaly by adding appropriate counterterms — but 
the newer approach seems more suitable for treating separate partial waves, and 
thus for application to more than two dimensions. 

In a remarkable piece of work, Iso et al.°4 and Bonora et al.°°°® have extended 
this approach to obtain a much more detailed picture of Hawking radiation. They 
show that if one considers not just the stress—energy tensor, but higher spin currents 
as well, one can recover the full thermal spectrum. Hawking radiation can thus be 
obtained, with only fairly minimal assumptions, from the symmetries of the near- 
horizon region of a black hole. 


5.6. Periodic Greens functions 


In ordinary quantum field theory, thermal Greens functions have a hidden period- 
icity. Consider, for instance, the Greens function of a scalar field y in a thermal 
ensemble at temperature T = 1/6 


Galalsa’a) = Tre?" Oy") =Tre ole, Oe Fe" ofa" ,2)) 
= Tr(o(a, 0)e Oo" ef p(x’, the O*) = Tr(y(a, O)e FO" y(n’, t + 48)) 
= Gg(a’,t +18; 2,0), (30) 


where I have used cyclicity of the trace and the fact that the Hamiltonian generates 
time translations, so e°" y(a’, t)e~°" = v(x’, t+i8). In particular, a thermal Greens 
function that is symmetric in its arguments must be periodic in imaginary time, 
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with period i(@. Conversely, in axiomatic quantum field theory such periodicity, 
formalized as the KMS condition,>” °° serves to define a thermal state. 

For a uniformly accelerating observer in Minkowski space, it was shown in 1976 
that the Greens function has just such a periodicity,*! with a temperature equal to 
the Unruh temperature (25). By the arguments of Sec. 5.2, one should expect an 
analogous result for a stationary observer near a black hole horizon. This is indeed 
the case, and ( is just the inverse of the Hawking temperature (1). Moreover, 
the response of a particle detector of the sort discussed in Sec. 5.3 is determined by 
a closely related Greens function, and the periodicity ensures quite generally that 
such a detector will see a thermal bath at temperature Ty. 


5.7. Gravitational partition function 


One may ask whether the periodicity properties of the Greens functions may be 
extended to the gravitational field itself. For ordinary quantum mechanical systems, 
it is well known that a thermal partition function may be obtained from the usual 
path integral by analytically continuing to periodic imaginary time with period 3.% 
The meaning of such a continuation is not so clear in general relativity, though: 
There is usually no preferred time coordinate to make imaginary, and there is 
no simple relationship between solutions of the field equations with “real time” 
(Lorentzian signature) and “imaginary time” (Riemannian signature). 

In one setting, though, there is a natural choice. If a spacetime is stationary — 
that is, if it admits a timelike Killing vector — the Killing vector defines a preferred 
time coordinate. In particular, for a nonextremal stationary black hole, the generic 
metric near enough to the horizon takes the form 


1 
2K(r — r+) 


ds? = 2x(r — r,)dt? dr? — r7.dQ?, 
where r+ is the location of the horizon and « is again the surface gravity. Setting 
¢t =i7 and replacing r by the proper distance to the horizon 


1 
p= Vir =), 


we obtain the “Euclidean black hole” metric 
ds? = dp? + ?p"dr? + r3.dQ?. (31) 


We can immediately recognize the first two terms of (31) as the metric of a flat 
plane in polar coordinates. The horizon r = r+ has collapsed to a point p = 0: The 
“Euclidean section” includes only the black hole exterior. But the p—7 plane is flat 
only if kr has a period 27. Otherwise, the origin is a conical singularity, and the 
metric will not extremize the action at that point. This periodicity, in turn, requires 
that 7 have a period 27/« = 1/Ty, exactly as in the preceding section. 

We now see that the periodicity of Greens functions did not depend on details 
of quantum fields, but arose directly from the geometry. But we can do more: as 
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Gibbons and Hawking argued,®* we can use this result to obtain a saddle point 
approximation to the gravitational path integral. 

By analogy with ordinary quantum field theory, the Euclidean path integral for 
the gravitational partition function can be formally written as 


z(6| = / [dgle™=, (32) 


where Ip,. is the “Euclidean” Einstein—Hilbert action — that is, the Einstein— 
Hilbert action for metrics of Riemannian signature — and the integral is over all 
metrics that are, in some sense, periodic in “Euclidean” time with period 3. This 
is, of course, an ill-defined expression: General relativity is nonrenormalizable, so 
even if we knew exactly what “periodic” meant here, we would not know how to 
make sense of the functional integral. Still, though, there ought to be some sense in 
which (32) has a meaning in effective field theory,®? and even if the full expression is 
ill-defined, a saddle point approximation could give a physically reasonable result. 

Naively, the saddle point contributions (31) gives a vanishing contribution to 
Touc: the “bulk” Einstein—Hilbert action 


1 4... /i-y 
male ey love 


is zero for any classical solution of the vacuum field equations. The key observation 
of Gibbons and Hawking® was that on a manifold with boundary, the “bulk” action 
must be supplemented by a boundary term, without which the action will typically 
have no true extrema.® The boundary was originally taken to be at infinity, but 
subsequent work has shown that it may also be placed at the horizon.®* © The 
resulting boundary term may be evaluated in a number of ways: For instance, if 
one dimensionally reduces the action to the p-r plane, the problem becomes purely 
topological.®° The final result is that 


Tone = oe B(M + QJ + 6Q). (33) 
The saddle point contribution e/®«° to the partition function (32) may then be 
recognized as the grand canonical partition function for a system with inverse tem- 
perature 3 = 27/« and entropy Spy = Anor/4hG, just as expected. 

The same analysis applies to much more general stationary geometries.©? Just 
as above, Killing horizons in the Lorentzian configurations translate into zeros of 
the Killing vector in the Riemannian continuation, and boundary terms resembling 
(33) again appear. Recently Neiman has argued that even in Lorentzian signature, 
the usual Gibbons—Hawking boundary term acquires an imaginary piece whenever 


the boundary involves “signature flips,” points at which the signature of the bound- 
ary changes from spacelike to timelike.” For black hole spacetimes, the resulting 
boundary term looks very much like the one in the Euclidean action. 
Alternatively, one may try to evade the ambiguities in “imaginary time” 
by starting with the Hamiltonian formulation of general relativity.°’ Perhaps 
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unsurprisingly, the same boundary term (33) appears. In canonical quantization, 
this boundary term gives rise to a new term in the Wheeler—-DeWitt equation, from 
which one can again recover the Bekenstein-Hawking entropy.’! 


5.8. Pair production of black holes 


As Heisenberg and Euler noticed in 1936," the quantum vacuum is unstable in the 
presence of a strong electric field. Virtual pairs of charged particles can be “pulled 
out of the vacuum” by the field, becoming physical particle-antiparticle pairs. The 
pair production rate per unit volume for a fermion of mass m and charge e in a 
constant field € was worked out in detail by Schwinger,” and takes the form 
Wr me BE? pre 
=) 

For NV identical species of fermions, this expression would be multiplied by NV. 

Now consider pair production of charged black holes in an electric field. Without 
a full quantum theory of gravity we cannot compute an exact production rate, but 
as in the preceding section, we can use the Euclidean path integral to obtain a 
semiclassical approximation. If the Bekenstein entropy is a statistical mechanical 
entropy, we would expect a black hole to have exp{.S} microscopic states, with a 
corresponding enhancement of the pair production rate by NV = exp{ Spy}. 

This is exactly what is found. The first computation, by Garfinkle et al.”4 con- 
sidered magnetically charged black holes, and compared the pair production rate 
to the rate for magnetic monopoles of the same mass. Subsequent work extended 
these results to a much wider variety of black holes.”>“" In every known case, pair 
production is enhanced just as one would expect if the Bekenstein—Hawking entropy 
represented an actual counting of states. 


5.9. Quantum field theory and the eternal black hole 


Hawking’s derivation of black hole thermodynamics was based on a collapsing star, 
with a Minkowski-like vacuum in the distant past. But we can also write down 
solutions of the Einstein field equations that represent eternal black holes. 

Recall that the maximally extended eternal black hole spacetime has a Carter— 
Penrose diagram with four regions, as shown in Fig. 4. Now consider a quantum 
state WV defined on a Cauchy surface © passing through the black hole bifurcation 
sphere, as shown in the figure. No information can pass from region II to region I, 
so even if U is a pure state, an observer in region I will see only a density matrix, 
obtained by tracing over region IT. This at least opens up the possibility that physics 
for an observer in region I will be thermal. 

Indeed, for a free quantum field there is at most one quantum state, the Hartle- 
Hawking vacuum state, that is regular everywhere on the horizon.”*”? For a scalar 
field, a direct computation shows that the density matrix obtained by tracing over 
region II is thermal, with a temperature T;,.°° For more general fields, the same can 
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Fig. 4. A Carter—Penrose diagram for an eternal black hole. 


be shown by means of fairly sophisticated quantum field theoretical arguments’*”? 


or by an analysis of the path integral.8! Thus, even if matter is secretly in a pure 
state, the eternal black hole is effectively also a thermal system. 


5.10. Quantized gravity and classical matter 


Although they differ in detail, the derivations of Hawking radiation so far have all 
analyzed quantum matter in a classical black hole background. This is a reasonable 
approximation, except perhaps for the “trans-Planckian problem” discussed above. 
In one special case, though, the direction can be reversed: Hawking radiation can 
be derived from classical matter in the presence of a quantum black hole. 

This special case is that of the BTZ black hole,®* a black hole in (2 + 1)- 
dimensional asymptotically anti-de Sitter space. This is a rather peculiar solution: 
The vacuum Einstein field equations in 2+ 1 dimensions imply that spacetime has 
constant negative curvature, and, indeed, the BTZ black hole can be described as 
a region of anti-de Sitter space with certain identifications.*? Nevertheless, it is a 
genuine black hole, the final state of collapsing matter, with a normal event horizon 
and (in the rotating case) a normal inner Cauchy horizon.84 

The BTZ black hole presents a puzzle for any statistical mechanical explana- 
tion of black hole entropy. As a corollary to the constant curvature required by 
the field equations, (2+1)-dimensional gravity contains no propagating degrees of 
freedom that could serve as candidates for microscopic states. As first suggested by 
Carlip®® and confirmed in detail by Strominger®” and, independently, Birmingham 
et al.,°8 the natural candidate for the microscopic degrees of freedom are “edge 
states,” additional degrees of freedom at infinity that appear because the boundary 
conditions break diffeomorphism invariance. 

I will discuss these states further in Sec. 9. For now, the key feature is that 
they allow an “effective” description of the quantum BTZ black hole in terms of a 
particular two-dimensional conformal field theory at infinity.8° Emparan and Sachs 
have shown how to couple this quantum theory to classical matter,°° and to use 
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the coupling to calculate transition amplitudes between quantum black hole states 
induced by interactions with this “background” matter. Detailed balance argu- 
ments then allow them to determine the emission rate of the matter fields, obtain- 
ing the correct Hawking radiation spectrum, even including the correct greybody 
factors. 


5.11. Other approaches 


I have tried to give an overview of the most common approaches to black hole 
thermodynamics, but I have necessarily omitted a few. For example 


e Instead of considering only the region outside the horizon, one may choose a 
time-slicing that crosses the horizon and continues to the singularity. A black hole 
interior is not static — the Killing vector that gives time translation invariance in 
the exterior becomes spacelike in the interior — and the Hamiltonian in horizon- 
crossing coordinates is therefore time-dependent. Nevertheless, one can perform a 
Heisenberg picture quantization to obtain time-dependent states which, although 
pure, behave “thermally” in the sense that they excite radiation detectors and 
have a net flux of radiation at infinity that matches Hawking’s results.9! 

e One may repeat the computation of Unruh radiation in the functional Schrédinger 
picture, in which the quantum state is a wave functional of the field configura- 
tion. The results are again what would be expected®?: The ground states for 
the stationary and accelerated observers are different, and the difference appears 
as a thermal distribution of “Minkowski” particles in the accelerated observer’s 
state. 

e York has proposed a model in which fluctuations of the event horizon lead to a 
“quantum ergosphere” through which particles can tunnel.9? The picture is too 
complicated to compute an exact Hawking temperature, but by restricting the 
horizon fluctuations to those corresponding to the lowest quasinormal mode, he 
obtains a value for the temperature that is accurate to to within about 2%. 


The derivations in this section are not entirely independent. For instance, the 
response function of a particle detector (Sec. 5.3) depends on a Greens function, 
and it is the periodicity of this function (Sec. 5.6) that determines the thermal 
response. This periodicity, in turn, may be traced the periodicity of the black hole 
metric in imaginary time (Sec. 5.7). The tunneling approach (Sec. 5.4) and the 
original analysis by Hawking both lead to relations of the form 


P(emis) = e~°” P(absor) 


between emission and absorption probabilities, and both results come from the 
behavior of complex functions near the horizon. Still, the derivations are sufficiently 
independent, with different enough assumptions and approximations, that together 
they provide very powerful evidence for the reality of Hawking radiation. 
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6. Thermodynamic Properties of Black Holes 


There is more to thermodynamics than temperature and entropy. If black holes 
are thermal objects, we should be able to apply the rest of thermodynamic theory, 
from Carnot cycles to phase changes, to systems containing black holes. In this 
section, after a short discussion of black hole evaporation, I will review four selected 
aspects: heat capacity, phase transitions, thermodynamic volume, and an argument 
that black hole thermodynamics prohibits certain violations of Lorentz invariance. 


6.1. Black hole evaporation 


We have seen that a black hole radiates as a grey body, at a temperature Ty. By 
the Stefan—Boltzmann law, it will therefore lose mass at a rate 

dM 

dt 
where € is some averaged emissivity, summed over the species of particles that can 
be radiated. For the Schwarzschild black hole, Ty ~ 1/M and A,,, ~ M?, so the 
lifetime goes as M*. To determine the exact coefficients one must treat the greybody 
factors carefully.2? The result is 


er hee (34) 


where Mo is the mass of the Sun. For charged or rotating black holes, evaporation is 
more complicated: Black holes can “discharge” via Schwinger pair production, but 
the Hawking temperature decreases with increasing charge and angular momentum. 


6.2. Heat capacity 


A Schwarzschild black hole has a temperature T = /87M. As an isolated black hole 
evaporates, it radiates energy, its mass decreases, and its temperature consequently 
increases. That is, a Schwarzschild black hole has a negative heat capacity 


Os 1 
C= Tar = Ea (35) 
For a rotating black hole, the situation is more complicated: As Davies has shown, 
the heat capacity at fixed angular velocity is always negative, but the heat capacity 
at fixed angular momentum is negative for low J, diverges as a critical value, and 
then becomes positive, a behavior indicative of a phase transition. 

While negative heat capacity is unusual, it is not a peculiar feature of black 
holes, but holds even for Newtonian self-gravitating systems. As a satellite in low 
Earth orbit loses energy to atmospheric friction, its orbit drops to a lower altitude, 
increasing its kinetic energy. For gravitationally bound stellar systems, Lynden- 
Bell has analyzed this behavior, which he calls it the “gravothermal catastrophe,” in 
detail.9° In Newtonian gravity, the runaway behavior is a consequence of the purely 
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attractive, long range nature of the gravitational interaction, and it stops when 
gravity is no longer the principle interaction governing the system. For a relativistic 
black hole, on the other hand, there seems to be no competing interaction, at least 
until the evaporating black hole has shrunk to the Planck scale. 

There are two standard (theoretical) ways to stabilize a black hole. The first 
is to place it in a reflecting cavity, allowing it to come into equilibrium with its 
Hawking radiation. For a spherical cavity of area A = 4mr%, the heat capacity for 
a Schwarzschild black hole becomes?” 


Ca = 81M? (1 _ — (= - 1) , (36) 


TB 'B 


which is positive for a small enough cavity, rg < 3M. The second is to consider a 
black hole in asymptotically anti-de Sitter space,9% in which the geometry serves 
as a sort of “cavity.” For a given temperature, one finds two possible anti-de Sitter 
black hole configurations, a “small” black hole with negative heat capacity and a 
“large” black hole with positive heat capacity. 


6.3. Phase transitions 


As already indicated, black holes can have a complicated phase structure, including 
phase transitions of various types. The most famous of these is the Hawking—Page 
transition for a black hole in asymptotically anti-de Sitter space,®* a first order 
phase transition between thermal radiation at low temperatures and a black hole 
at higher temperatures. In the AdS/CFT correspondence of string theory, this 
transition has an interesting “dual” description as a confinement—deconfinement 
transition.°? 

We have come to understand that this kind of behavior is not at all excep- 
tional. As one example among many,!”° consider a black hole with charge Q in 
asymptotically flat space, enclosed in a cavity whose walls are maintained at a 
fixed temperature 1/3. One finds a phase diagram in the Q—@ plane containing 
a line of first-order phase transitions, terminating at a critical point which is the 
location of a second-order phase transition with calculable critical exponents. 


6.4. Thermodynamic volume 


The first law of black hole mechanics (2) closely resembles the standard form of 
the first law of thermodynamics, but it is missing a term, the pressure term —pdV. 
There is, in fact, a candidate for “pressure”: The cosmological constant A has the 
correct dimension, and acts as a pressure in cosmological settings. But A is not 
normally considered a state function. It is a coupling constant that is fixed by the 
theory, not by the state of the thermodynamic system. 

There are, however, situations in which A is a dynamical field, forced to be 


constant only by virtue of its field equations.1°' In that case, it may make sense to 
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treat it as a state function,!? and to generalize the first law to the form!%:!4 


6M =T5S + 26J + ®5Q + VinermOD, (37) 


where p = —A/87G. Note that in this formulation, the mass M should be inter- 
preted as enthalpy (that is, U + pV, where U is the internal energy) .1°4 

The “thermodynamic volume” Viherm is defined by (37), and exists even in 
the limit A — 0. Its physical meaning is not very well understood, though. For a 
Schwarzschild black hole, it is easy to check that 


4 
Viherm = Pies a (38) 


where r+ is the location of the horizon in Schwarzschild coordinates. This relation 
generalizes to higher dimensional static black holes, including those with charge: 
The thermodynamic volume is the volume of a flat round ball whose surface area 
is equal to the area of the event horizon. But the generalization to rotating black 
holes is complicated, and the corresponding thermodynamic volume does not yet 
have a simple physical interpretation.! 

The inclusion of a volume term in the first law (37) enriches the phase structure 
of the theory, with a number of interesting consequences.!°6107 For example, the 
equation of state for a charged, rotating anti-de Sitter black hole becomes very close 
to that of a van der Waals gas. Rotating anti-de Sitter black holes in more than six 
dimensions exhibit reentrant phase transitions; and anti-de Sitter black holes in six 
dimensions with multiple spins have a triple point. 


6.5. Lorentz violation and perpetual motion machines 


Lorentz invariance guarantees that there is a single “maximum speed” c, the same 
for all particles. I have implicitly used this assumption throughout this review, 
choosing units c = 1. Suppose, though, that Lorentz invariance is broken, in such 
a way that two species of particles — say A and B — have different maximum 
speeds cp > c4. Then Dubovsky and Sibiryakov have shown!°® that a black hole 
can be used to build a “perpetual motion machine of the second kind,” a device 
that transfers heat from a cold reservoir to a hot reservoir without any net use 
of energy. In some Lorentz-violating theories, there is even a classical mechanism, 
analogous to the Penrose process, for “mining” a black hole.!°9 

To understand the argument, first note that the event horizon for the more 
slowly moving species A will lie outside the event horizon for the faster species B. 
This is physically intuitive, but also follows directly if we restore the factors of c in 
the definition of the Schwarzschild event horizon, r, = 2G'M/c?. Correspondingly, 
the Hawking temperature T'4,4 of species A should be lower than the Hawking 
temperature Tp, 77 of species B. 

Dubovsky and Sibiryakov’s “construction manual” is then the following. As 
heat reservoirs, construct a shell A at temperature T'4 she that interacts only with 
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species A, and a second shell B at temperature Tp, s,¢, that interacts only with 
species B. Arrange the temperatures so that 


Tp. > Tp,she > Ta,sneu > Tan. (39) 


This will result in a net energy flux from shell A into the black hole, and from 
the black hole out to shell B. It is not hard to check that one can adjust the shell 
temperatures so that the flux into the black hole balances the flux out. The net 
result, then, is a flux of energy from the “cold” shell A to the “hot” shell B, violating 
the second law of thermodynamics. 

Arguments like this should be treated cautiously. Historically, the generalized 
second law of thermodynamics has shown remarkably resilience, often involving 
subtle and unexpected effects. There may be natural Lorentz-violating theories in 
which all matter shares a single universal horizon, for instance, or theories in which 
the black hole interior carries hidden entropy. But the apparent paradox shows, at 
least, that simple thermodynamic arguments about black holes may place surprising 
restrictions on nongravitational physics. 


7. Approaches to Black Hole Statistical Mechanics 


In ordinary thermodynamic systems, thermodynamic properties reflect the statisti- 
cal mechanics of underlying microscopic states. Entropy, in particular, is a measure 
of the number of accessible states. One might hope for the same to be true for black 
holes. Then, since the Bekenstein—Hawking entropy depends on both Newton’s con- 
stant G and Planck’s constant h, it is plausible that black hole thermodynamics is 
telling us something about quantum gravitational states. 

Black hole thermodynamics may therefore present us with a rare opportunity. 
There is very little that we actually know about quantum gravity, so any insight is 
likely to be valuable. Like many opportunities, though, this one is accompanied by 
a serious difficulty: In the absence of a quantum theory of gravity, how do we start 
to understand the quantum states of a black hole? 

Fortunately, while we do not yet have a complete quantum theory of gravity, 
we have a number of research programs, at various levels of maturity, attempting 
to develop such a theory. Some of these are advanced enough to be able to make 
predictions about at least certain classes of black holes. Indeed, as I shall discuss 
in Sec. 9, the real problem may be not that we have no microscopic explanation of 
black hole entropy, but that we have too many. 


7.1. “Phenomenology” 


From the earliest days of black hole thermodynamics, there was an obvious simple 
“phenomenological” model for Bekenstein’s area law?: One had to merely assume 
that black hole horizon area was quantized, with discrete “plaquettes” of area on the 
order of the Planck area. The simplest choice — partially justified by the observation 
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that the horizon area is an adiabatic invariant!!® — would a discrete, evenly spaced 
area spectrum. 

Suppose, for instance, that the Bekenstein-Hawking entropy were exactly the 
logarithm of the number of states. This number is, of course, an integer, so successive 
values of the area would have to differ by an amount 


AA =4hGInk, (40) 


for some integer k. As Hod first observed,!'! one could then appeal to the Bohr 
correspondence principle to determine the area spacing. While black holes have 
no stable excitations, they do have damped “ringing modes,” called quasinormal 
modes, with complex frequencies.!!? The most strongly damped quasinormal modes 
of the Schwarzschild black hole have frequencies 


In3 
~ 8nGM 
The Bohr correspondence principle would then suggest that these frequencies should 
correspond to transitions between adjacent area eigenstates, 


AA Aln3 
A= te 327G2M so 8r GM’ (42) 


which exactly matches (40) with k = 3. 

Arguments of this sort are suggestive, but must also be treated with care. For 
instance, within the same framework, Maggiore!!® has argued — based in part 
on an analogy with a damped harmonic oscillator — that relevant frequency one 
should use for the Bohr correspondence principle is not the real part Rew of the 
quasinormal frequency, but the modulus |w|. For highly damped modes 


1 1 
lel ~ ton (n+ 5): (43) 


and the Bohr correspondence principle then gives 


Rew (41) 


AA = 8nhG. (44) 


A wide range of simple models have been explored — Medved’s review! gives a 
partial list — and they do not agree about the area spectrum. The lesson seems 
to be, not surprisingly, that thought experiments and analogies can only go so far. 
One needs a genuine quantum theory of gravity to obtain a real answer. 

In the remainder of this section, I will briefly summarize some of the more 
specific approaches to black hole statistical mechanics from the point of view of 
particular models of quantum gravity. 


7.2. Entanglement entropy 


Consider a space that is divided into two disjoint pieces A and B. Suppose a quan- 
tum system on that space is characterized by a pure state |W), with a corresponding 
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density matrix pag = |V)(v|. The von Neumann entropy of such a pure state 
Sun = Trplnp (45) 


is, of course, zero. 

Now, however, imagine an observer who is restricted to the region A, and cannot 
observe anything in B. All observations of such an observer can be described by a 
reduced density matrix, p4 = Trp pap, whose entropy will in general be nonzero. 
For a local theory with a finite-dimensional Hilbert space, for instance, we can write 
the state |W) in a Schmidt decomposition 


IY) = So cilda)Ivi), (46) 


where {|@;)} and {|y;)} are orthonormal bases of states in regions A and B. Then 
pa = >- |ci?|bi) (oil, (47) 

and the von Neumann entropy of region A is simply 
Sa=Trpalnpa = |ci/* In|), (48) 


the Shannon entropy for the probabilities p; = |c;|?. 

This “entanglement entropy” is now commonplace in quantum theory, but it was 
first introduced by Sorkin in the context of black hole thermodynamics.'!° The idea 
is that an observer outside the horizon (“region A”) knows nothing about the state 
inside the horizon (“region B”), and therefore sees a mixed state with nonvanishing 
entropy. This picture arises naturally from the eternal black hole of Sec. 5.9: The 
density matrix that gives the entropy (48) is the same as the thermal density matrix 
that gives the Hawking spectrum. For many states, including the vacuum state, the 
entanglement entropy for a black hole is proportional to the horizon area,!!> 1” 
since the main contribution comes from correlations among degrees of freedom very 
close to the horizon, for which the “bulk” state is irrelevant. 

The coefficient of this area term, on the other hand, diverges, for essentially the 
same reason: The more closely degrees of freedom are localized to the horizon, 
the higher their energies. The simplest cut-off, a “brick wall” approximately one 
Planck length in proper distance from the horizon,'!® gives a coefficient for the 
Bekenstein—Hawking entropy of the right order of magnitude. In general, though, 
the exact value depends sensitively on the number and type of entangled fields — 
that is, on the particle content of the Universe. 

A possible solution to this “species problem” comes from the observation by 
Susskind and Uglum that the same modes responsible for the entanglement entropy 
also renormalize Newton’s constant, which also appears in the Bekenstein—Hawking 
entropy formula.'!® It is plausible that these contributions cancel. In particular, 
Cooperman and Luty!”° have recently argued that for a certain class of states 
defined by a path integral, the leading renormalized term in the entanglement 
entropy is given by the corresponding renormalized Bekenstein—Hawking formula, 
independent of the particle content of the theory. 
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Inspired by the AdS/CFT correspondence of string theory, Ryu and Takayanagi 
have proposed an alternative “holographic” approach to defining and regulating 
entanglement entropy.!?!:!?? In the description I have given above, the regions A 
and B lie on a (d—1)-dimensional spacelike slice of a d-dimensional spacetime. Ryu 
and Takayanagi propose embedding this spacetime as the asymptotic boundary of 
(d+1)-dimensional anti-de Sitter space. The boundary between regions A and B can 
then be extended into the “bulk” anti-de Sitter space as a d-dimensional minimal 
surface; Ryu and Takayanagi propose that the entanglement entropy should be 
proportional to the area of this surface. While this approach is obviously inspired 
by black hole thermodynamics, can be proven without any assumptions about black 
holes, at least for the static case.!2? The holographic entanglement entropy can 
then be used to determine the entropy of a black hole, and once again obtains the 


Bekenstein—Hawking expression, with no divergences or species problem.!2* 


7.3. String theory 


The leading research program in quantum gravity today is string theory. I will 
not attempt to summarize the theory here, but will instead focus on three rather 
different approaches to black hole statistical mechanics within string theory. 


7.3.1. Weakly coupled strings and branes 


The first successful calculation of black hole entropy in string theory, by Strominger 
and Vafa,!?° looked at a class of multiply charged, extremal (that is, maximally 
charged), supersymmetric (BPS) black holes in five dimensions. As classical objects, 
such black holes are completely determined by their charges: The mass, horizon 
area, and all other macroscopic quantities can be expressed as functions of the 
charges. As string theoretical objects, on the other hand, they are strongly bound 
states of the fundamental constituents of string theory, strings and D-branes. Stro- 
minger and Vafa suggested “turning down” the strength of the gravitational inter- 
action, until a black hole became a weakly coupled system of strings and D-branes. 
These constituents would still have the prescribed charges, but at weak coupling one 
could also count the number of states with such charges. The result, translated back 
into the black hole parameters, exactly matched the Bekenstein—Hawking entropy. 

These results were quickly extended to a large number of extremal and near- 
extremal black holes, and, through dualities, to some particular nonextremal black 
holes as well.!261?" A straightforward extension of the argument gives Hawk- 
ing radiation, coming from decay of excited states of the constituent strings and 
D-branes, complete with the correct greybody factors.!?8 One might worry about 
the consistency of the procedure: The process of turning down the gravitational 
coupling need not preserve the number of states. For supersymmetric black holes, 
though, the number of states is protected by nonrenormalization theorems. For 
nearly supersymmetric black holes, the procedure continues to work well, perhaps 
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even better than one might expect. Far from extremality, it becomes harder to 
obtain the right factor of 1/4 in the entropy. 

There is one peculiar feature of these calculations, which I will return to in 
Sec. 9. While the final result is an expression for entropy in terms of horizon area, 
the connection is indirect. On the strong coupling side, one may start with the 
horizon area, but one must reexpress it in terms of charges. On the weak coupling 
side, one may start with the number of states, but one must again reexpress it in 
terms of charges. It is only when the two processes are compared that one obtains 
the Bekenstein—Hawking entropy. This means that each new type of black hole 
requires a new calculation. Some crucial aspect of universality is missing. 


7.3.2. Fuzzballs 


Mathur has proposed running the analysis of the preceding section backwards.!° 


Suppose one starts on the weak coupling side, with a particular collection of strings 
and D-branes whose charges match those of a desired black hole. One may then 
turn the gravitational coupling up, and try to see what geometry appears at strong 
coupling. The result is typically not a black hole, but rather a “fuzzball,” a con- 
figuration with no horizon and no singularity, but with a geometry that looks very 
much like that of a black hole outside a would-be horizon.!?°:!! This phenomenon 
seems to require the extra dimensions available in string theory, but it provides an 
intriguing new picture of black holes. For example, Hawking radiation can appear 
as a result of a classical instability in a fuzzball geometry.!°” 

In a few special cases, one can count the number of such classical “fuzzball” 
geometries and reproduce the Bekenstein—Hawking entropy. In general, though, it 
is likely that many of the states relevant for determining the Bekenstein—Hawking 
entropy will not have classical geometric descriptions. 


7.3.3. The AdS/CFT correspondence 


A third approach to black holes in string theory exploits Maldacena’s celebrated 
AdS/CFT correspondence.'?*:134 This extremely well-supported conjecture states 
that string theory in a d-dimensional asymptotically anti-de Sitter spacetime is dual 
to a conformal field theory in a flat (d — 1)-dimensional space that can, in a sense, 
be viewed as the boundary of the AdS spacetime. For asymptotically anti-de Sitter 
black holes, this means that one can, in principle, compute the entropy by counting 
states in a (nongravitational) dual conformal field theory. 

The most straightforward application of this duality occurs for the (2+ 1)- 
dimensional BTZ black hole discussed in Sec. 5.10. Here, the dual description is 
given by a two-dimensional conformal field theory. As I shall describe in more detail 
in Sec. 9, two-dimensional conformal field theories have an exceptionally large sym- 
metry group that controls many of their properties. In particular, the density of 
states is determined by a single parameter c, the central charge, which characterizes 
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the conformal anomaly. As we saw in Sec. 5.5, anomalies can provide very useful 
tools; here, they are powerful enough to determine the density of states. 

For asymptotically anti-de Sitter gravity in 2+ 1 dimensions, the central charge 
is dominated by a classical contribution, which was discovered some time ago by 
Brown and Henneaux.®° Strominger®” and Birmingham et al.8° independently real- 
ized that this result could be used to compute the BTZ entropy, precisely reproduc- 
ing the Bekenstein—Hawking expression. As noted in Sec. 5.10, the same methods 
can be used to compute Hawking radiation for the BTZ black hole. 

While this result appears to be rather specialized, it has an important exten- 
sion. Many higher dimensional near-extremal black holes, including black holes that 
are not themselves asymptotically anti-de Sitter, have a near-horizon geometry of 
the form BTZ x trivial, where the “trivial” part merely renormalizes constants in 
the calculation of entropy. As a result, the BTZ results can be used to obtain the 
entropy of a large class of string theoretical black holes, including most of the black 
holes whose states could be counted in the weak coupling approach of Sec. 7.3.1.1%° 


7.4. Loop quantum gravity 


The second major research program in quantum gravity is loop quantum gravity, 
or “quantum geometry.” I will again focus only on those elements relevant to black 
hole thermodynamics. 

The following are the key features relevant to this question: 


(1) A basis for the kinematical states of the theory is given by spin networks, graphs 
with edges labeled by spins and vertices labeled by SU(2) intertwiners. 

(2) Given any surface }, each spin network is an eigenstate of the area operator 
As, with eigenvalue 


Ay = 817G » Vi +1), (49) 


where the sum is over the spins 7 of edges of the spin network that cross ©. Note 
that this area is quantized, but in a rather complicated way, with a spectrum 
that can be shown to have an elaborate substructure.!37188 

(3) Operators such as the area and volume depend on a parameter 7, the Barbero— 
Immirzi parameter. The significance of this parameter is quite poorly under- 
stood. Theories with different values of y are probably inequivalent, but it has 


been suggested that y may not appear in properly renormalized observables!%? 


or in a somewhat different approach to quantization.'4° 
(4) The physical states are obtained from the spin networks by imposing the Hamil- 


tonian constraint, a procedure that is not yet completely under control. 


7.4.1. Microcanonical approach 


Given this structure, a natural first step is to choose the surface © to be a black 
hole horizon and count the number of spin network states that give a prescribed 
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area.!4!,142 This is a bit tricky, since the true event horizon is defined by the global 
properties of the spacetime. On the other hand, we know that Hawking radiation 
depends only on local properties, so a local characterization should be adequate. 
The relevant definition is probably that of an “isolated horizon,” a null surface with 
vanishing expansion that obeys the laws of black hole mechanics (see the Appendix 
for details) 91136 

As we shall see in Sec. 9, the specification of such a horizon is a sort of 
boundary condition, which requires the introduction of boundary terms in the 
Einstein—Hilbert action. For loop quantum gravity, these boundary terms induce 
a three-dimensional Chern—Simons action on the horizon. The careful version of 
the state-counting'4*!44 can then be translated into a counting of Chern—Simons 
states, a well-understood procedure,!*° although with slightly subtle combinatorics. 


The ultimate result is a microcanonical black hole entropy!*6!47 
YM Ahor 
= IM Abor 50 
y 4hG A) 
where ¥ is the Barbero-Immirzi parameter and 
ym ® 0.23753 (51) 


is a constant determined as the solution of a particular combinatoric problem. If 
one chooses 7y = yy, one recovers the standard Bekenstein—Hawking entropy. 

The significance of this peculiar value yay is not understood, and it may reflect an 
inadequacy in the quantization procedure or the definition of the area operator.!4° 
Note, though, that the Barbero—Immirzi parameter appears only in the combination 
Gy, where G is the “bare” Newton’s constant, i.e. the parameter appearing in 
the action. The relevant constant in the Bekenstein—Hawking entropy, however, 
is the renormalized value,!®° 
renormalized in loop quantum gravity, the “physical” value of 7 remains uncertain. 


and since it is not known how Newton’s constant is 


Ideally, one could address this question by computing the attraction between two 
test masses in the Newtonian limit, but this seemingly straightforward problem 
is not yet solved — loop quantum gravity is defined at the Planck scale, and the 
extrapolation to physically realistic distances remains extremely difficult. 

In any case, though, once y is fixed for one type of black hole — the 
Schwarzschild solution, for instance — the loop quantum gravity computations give 
the correct entropy for a wide variety of others, including charged black holes, rotat- 
ing black holes, black holes with dilaton couplings, black holes with higher genus 
horizons, and black holes with arbitrarily distorted horizons.'4*!49 In particular, 
there is no need to restrict oneself to near-extremal black holes. 

As an interesting variation within this framework,!°? one may again look at the 
horizon area (49), but instead of counting states of the boundary Chern—Simons 
theory one may count the number of ways the boundary spins can be joined to 
a single interior vertex. This means, in essence, that one is completely course- 
graining the interior state of the black hole, much as one does in an entanglement 
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entropy computation. There are, in fact, interesting relationships to entanglement 
entropy. The end result is again an entropy proportional to the horizon area, but 
now requiring a different value of the Barbero-Immirzi parameter to match the 
Bekenstein—Hawking result. 


7.4.2. Other ensembles 


The microcanonical approach described above involves a hidden assumption that 
all spin networks that give the same horizon area occur with equal probability. This 
is a standard assumption in ordinary thermodynamics, but gravitating systems are 
not quite ordinary. Moreover, the microcanonical approach uses only properties of 
the “kinematical Hilbert space,” that is, the space of states before the Hamiltonian 
constraint has been imposed. But we know in other contexts that although Hawking 
radiation is basically kinematical, the Bekenstein—Hawking entropy depends on the 
dynamics,?” including the Hamiltonian constraint. 

In the past several years, interesting progress has been made in moving away 
from these assumptions. In particular 


(i) The quantum Hamiltonian that generates translations along the horizon has 
been identified,!°! and yields a local temperature and energy, as measured by 
an idealized particle detector, that agree with semiclassical expectations.!°? 
This is not yet an enumeration of microscopic states, but it is a version of 
the thermodynamic computation of Sec. 5.3 in a fully quantum gravitational 
setting. 

(ii) One can define a sort of grand canonical ensemble, in which the number of 
punctures — that is, of edges of the spin network that cross the horizon — 
appears with a corresponding chemical potential.‘°? The Barbero—Immirzi 
parameter still appears, but in a different form: the entropy becomes 


+ No(y), (52) 


where o(7) is a Lagrange multiplier that vanishes when y takes the value (51). 
It has been argued that if one considers additional couplings to matter, and 
makes the “holographic” assumption that the matter entropy near the horizon 
scales with the area, then the Lagrange multiplier term vanishes in the classical 
limit, giving back the standard Bekenstein-Hawking entropy.!>4 

(iii) The Barbero-Immirzi parameter can in principle take any value, but there 
is one “natural” value, y = 7. At this value, the Ashtekar—Sen connection of 
loop quantum gravity is self-dual, and the Hamiltonian constraint becomes 
much simpler. Loop quantum gravity is unfortunately much harder to work 
with when y = 7, because of the need for added constraints to ensure that 
the metric is real, but several recent computations!®® '°7 indicate that this 
choice also gives the correct Bekenstein—Hawking entropy, perhaps even more 
simply. 
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7.5. Induced gravity 


8 is a model in which the Einstein— 


Induced gravity, as first proposed by Sakharov,'® 
Hilbert action is not fundamental, but appears a consequence of the presence of 
other fields. If one starts with a theory of quantum fields propagating in an initially 
nondynamical curved spacetime, counterterms from renormalization will automat- 
ically induce a gravitational action, which almost always includes an Einstein— 
Hilbert term at lowest order.!°? In Sakharov’s language, the resulting spacetime 
dynamics is a kind of “metric elasticity” induced by quantum fluctuations of matter. 
One can write down an explicit set of “heavy” fields that can be integrated 
out in the path integral to induce the Einstein—Hilbert action. The gravitational 
counterterms normally have divergent coefficients that must be renormalized, but by 
including an appropriately chosen collection of nonminimally coupled scalar fields, 
one can obtain finite values for Newton’s constant and the cosmological constant.!®° 
Given such a model, one can then count quantum states of the “heavy” fields in 
a black hole background.'©°1! Some subtleties occur in the definition of entropy 
in the presence of nonminimally coupled fields, but in the end the computation 
reproduces the standard Bekenstein—Hawking expression, with the correct coeffi- 
cient. Furthermore, if one dimensionally reduces to a two-dimensional conformally 
invariant system near the horizon, one can count states by standard methods of 
conformal field theory,!®? as described below in Sec. 9. This offers a new, and 
apparently completely different, picture of black hole microstates as those of the 
ordinary quantum fields responsible for inducing the gravitational action. 


7.6. Other approaches 


Just as in black hole thermodynamics, there are a number of other approaches to 
black hole statistical mechanics that also show promise. For example 


e Classical black holes do not have excited states, but they have characteristic 
damped oscillations, called quasinormal modes.'!? York has argued that these 
modes should be quantized,®*? and as noted in Sec. 5.11, obtains a value close to 
the Bekenstein—Hawking entropy with an approximate quantization. 

e The “causal set” program is an attempt to quantize gravity that starts by replac- 
ing a continuous spacetime by a discrete set of points with specified causal rela- 
tions.'63 While it is not yet known how to perform an exact calculation of black 
hole entropy, there are indications that one can obtain a reasonable value by 
counting the number of points in the future domain of dependence of a cross- 

164 or the number of causal links crossing the horizon.!® 

e Zurek and Thorne have shown that there is sense in which the entropy of a black 
hole counts the number of distinct ways in which a black hole with prescribed 


macroscopic properties such as mass and angular momentum can be formed from 
166 


section of the horizon 


the collapse of quantum matter. 
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7.7. Logarithmic corrections 


While many approaches to quantum gravity give the correct Bekenstein-Hawking 
entropy, they may differ on the next order corrections. In general, one expects 
An Ay 
Seu = Tye +t aln ( iG aca (53) 


where the coefficient @ may depend on the quantum theory. There are some argu- 


ments for a “universal” answer that, as in Sec. 9, depends on conformal field the- 
ory,'®? but these are not conclusive, and particular models seem to give varying 


168-171 The problem is further complicated by the fact that the logarithmic 
172 


results. 
terms in (53) may differ depending on one’s choice of thermodynamic ensemble. 


8. The Holographic Conjecture 


In ordinary thermodynamic systems, entropy is an extensive quantity, scaling with 
volume. For black holes, in contrast, the entropy scales with area. It is not so 
surprising that entropy is not quite extensive for a gravitating system: Gravity 
cannot be shielded, so when one increases the size of a system one is necessarily 
changing the internal interactions. Nor is it surprising that entropy cannot be simply 
defined in terms of a density that can be integrated over a volume: There is a 
general result that, as a consequence of diffeomorphism invariance, observables in 
general relativity cannot be defined in terms of local densities.!”° Still, an entropy 
proportional to area is a rather dramatic departure. 

’t Hooft!”* and Susskind!”® have proposed that this feature is not unique to 
black holes, but is a general property of any gravitating system. They suggest that 
the entropy inside any region, whether or not it is a black hole, should scale as the 
area rather than the volume. This conjectured “holographic principle” would imply 
that our usual view of local physics drastically overcounts degrees of freedom. 


t,!76 consider 


As a first argument for the plausibility of this holographic viewpoin 
an approximately spherical surface S with a surface area As = 47R?. Let us try 
to fill the interior of this surface with quantum excitations. To be contained in the 
region, each excitation should have a wavelength no larger than 2R, and thus an 
energy FE = h/2R. N such excitations will have an energy E = hN/2R, and to 
avoid forming a black hole, we must have R = 2G'E. Hence 

ree 

~ hG 
If we interpret N as the number of degrees of freedom in the system, we see that 
(54) implies a holographic bound. 

As asecond argument, suppose the surface S now contains almost enough matter 
to form a black hole. Now surround the region by a collapsing shell of matter, with 
enough additional mass to form a black hole. The initial state has an entropy 


(54) 


Sint = Di seseiae Si Sshetts 
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while the final state has an entropy 


As 

Te 

By the generalized second law of thermodynamics, Syn. > Sinu. Hence, the 
matter initially inside S must have had no more entropy than that of a black 
hole. 

Neither of these arguments is conclusive. The first implicitly assumed that the 
region inside S was flat. One may be able to fit more degrees of freedom into a 
curved space. There are, in fact, classical configurations — so-called “monsters” — 
in which the entropy within a region is greater than that of the corresponding black 
hole,!”” although all known examples form from initial singularities (they cannot 
be “made in the laboratory”) and collapse quickly to form black holes. 

The second argument fails for a more subtle reason: the area of S is not a gauge 
invariant quantity.!7*179 We are really imagining a timelike “world tube” traced 
out by S and asking its cross-sectional area at a fixed time. But this area depends 
on the choice of time-slicing, and by choosing a slice that “wiggles” enough in a 
timelike direction — and is therefore nearly null over much of its intersection with 
the world tube — we can make the area as small as we like. 

One might worry that the same problem occurs for black holes, and that the 
“horizon area” might also not be well-defined. In fact, black holes evade the prob- 
lem, essentially because the horizon is not timelike, but null (and, for stationary 
black holes, nonexpanding). Bousso!®? has proposed a covariant entropy bound 
that exploits a similar idea. He starts with a closed surface S, no longer necessar- 
ily spherical, and extends it not as a timelike world tube, but as a “collapsing” 
light sheet, a null hypersurface of decreasing area extending to the past and future 
of S, as illustrated in Fig. 5. Bousso’s conjecture is that the total entropy flux 
through either light sheet — At or A~ in the figure — is bounded by Ag/4hG. 
For the special case of a black hole, A~ may be interpreted as the event hori- 
zon, and the Bousso entropy bound is basically the generalized second law of 
thermodynamics. 

Classically, Bousso’s conjecture can be proven correct, given suitable energy con- 


Seinal = 


ditions and bounds on the entropy flux and its gradient.!7%:!8! Quantum mechani- 
cally, the situation is less clear, though there is strong evidence for the case of free 
fields with negligible gravitational backreaction.'®? 

Although the holographic principle in its larger sense remains a conjecture, 
there is one case in which it has been dramatically successful. The AdS/CFT corre- 
spondence of string theory!**:154 describes the “bulk” physics of an asymptotically 
anti-de Sitter space in terms of a lower-dimensional “surface” conformal field the- 
ory. If this holographic viewpoint can be extended beyond the anti-de Sitter case, 
black hole thermodynamics will have had a fundamental impact on a far wider field, 
profoundly altering our approach to fundamental physics. 


II-448 S. Carlip 


At 


Fig. 5. Future-directed null geodesics orthogonal to S form two light sheets, usually one expand- 
ing outward (not shown) and one collapsing inward (A+). The same holds for past-directed null 
geodesics. In this figure, symmetry causes the light sheets to collapse to single points. In gen- 
eral, they will end at caustics where neighboring geodesics cross and the expansion becomes 
positive. 


9. The Problem of Universality 


For a physicist working on quantum gravity, black hole thermodynamics is, at first 
sight, a huge source of hope. The Hawking temperature and Bekenstein—Hawking 
entropy are “quantum gravitational,” depending on both Planck’s and Newton’s 
constants. Moreover, as Wheeler’s famous aphorism states, “a black hole has no 
hair” — a classical black hole has no distinguishing characteristics beyond its mass, 
charges, and spins. Hence, if the entropy of a black hole is an ordinary statistical 
mechanical entropy, it seems that the only states it can be counting are quantum 
gravitational states. We may thus have a window into quantum gravity. 

Upon closer examination, though, the situation is not so simple. As we saw in 
Sec. 7, we suffer an embarrassment of riches: There seem to be many different quan- 
tum descriptions of black hole statistical mechanics, with very different quantum 
states, all giving the same entropy. Moreover, while the simple Bekenstein area law 
seems natural in some of these approaches, in others it seems almost miraculous. 

The “problem of universality” is really two problems, which are logically distinct, 
although their solutions may be related. The first is that black hole entropy has such 
a simple, universal value, one-fourth of the horizon area in Planck units, regardless 
of the mass, spin, charges, or the number of dimensions. Even the horizon topology 
is irrelevant: The same area law holds for black holes, black strings, black rings, and 
black branes. The second is that so many different approaches to quantum gravity 
give the same result, regardless of any of the details of the black hole states. 
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It is tempting to address the first problem with a claim that the horizon area 
must be quantized, as in Sec. 7.1. But such an answer is at best incomplete: It 
does not explain the universal coefficient of 1/4, nor does it tell us why no other 
features matter. There is an interesting explanation of the factor of 1/4 in terms 
of the topology of the Euclidean black hole, but it is not clear why this feature 
would be reflected in the counting of states. 

A cynical answer to the second problem is that only quantum theories that 
give the “right” answer are published. But this, too, is at best incomplete: It does 
not explain why any such models work at all. Consider, for example, the weakly 
coupled string approach of Sec. 7.3.1. As we saw there, the theory gives the correct 
entropy for a large class of near-extremal black holes. But each new type of black 
hole requires a separate computation, and the relationship between entropy and 
area only appears at the very last step. Such miracles cry out for explanation. 

In the remainder of this section, I will describe one attempt to explain these 
miracles. This is still very much a case of work in progress, but there have been 
some hopeful signs. 


9.1. State-counting in conformal field theory 


Statistical mechanical entropy is basically a measure of the number of quantum 
states, so we are looking for a universal mechanism to explain the density of states 
of a black hole. There is one context in which such a mechanism is known: that of 
two-dimensional conformal field theory. 

The metric for a two-dimensional manifold can always be written locally as 


ds? = 2g,zdzdz, (55) 


in terms of complex coordinates z and Z. Holomorphic and antiholomorphic coordi- 


nate changes z > z+&(z), 7 ~ 7+ €(Z) merely rescale the metric, and provide the 
basic symmetries of a conformal field theory. Unlike the conformal symmetry group 
in higher dimensions, the two-dimensional group has infinitely many generators, 
denoted L[€] and L[€]. These satisfy a Virasoro algebra‘? 


(LE), Lil] = Line! - én!) + ef datas" — e'", 
@ 


[E(2], Lal] = Line’ - én’) + — | az(a’e” - 2&7’), cao) 


[L{é), L[nl] = 9, 
uniquely determined by the values of the two constants c and ¢, the central charges. 
As in ordinary field theory, the zero modes Lo and Lo of the symmetry generators 
are conserved quantities, the “conformal weights,” which can be seen as linear 
combinations of mass and angular momentum. 
Two-dimensional conformal symmetry is remarkably powerful. In particular, 
with a few mild restrictions, Cardy has shown that the asymptotic density of states 
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of any two-dimensional conformal field theory is given by!®?:!*4 


[cL 7 [eL 
In p(Lo) ~ 2a =, In p(Lo) ~ 2m = (microcanonical) 


1? 1? 
In p(T) ~ ach Inp(T) ~ gf (canonical) 


(57) 


where 7 is the temperature. The entropy is thus uniquely determined by a few 
parameters, regardless of the details of the conformal field theory — just the kind 
of universal behavior we would like for black holes. 


9.2. Application to black holes 


Black holes are neither conformally invariant nor two-dimensional, so one might 
wonder whether the preceding results are relevant. There is a sense, however, in 
which the near-horizon region is nearly conformally invariant: The extreme redshift 
washes out dimensionful quantities such as masses for an observer who remains 
outside the black hole.!®> The same redshift also makes transverse excitations neg- 
ligible relative to those in the r — t plane, effectively reducing the problem to 
two dimensions. This is the same dimensional reduction that allowed many of the 
anomaly-based calculations described in Sec. 5.5. 

Conformal techniques of this type were first applied to the (2 + 1)-dimensional 
BTZ black hole.8”:88 Here, the connection to two-dimensional conformal field the- 
ory is clear: The asymptotic boundary at infinity is a two-dimensional cylinder, and 
the canonical generators of diffeomorphisms obey a Virasoro algebra. Moreover, as 
Brown and Henneaux showed long ago,®*® this Virasoro algebra has a classical cen- 
tral charge, whose presence can be traced back to the need to add boundary terms 
to the canonical generators. The conformal weights Lo and Lo for the BTZ black 
hole have simple expressions in terms of charge and mass, and the microcanoni- 
cal form of the Cardy formula (57) then gives the standard Bekenstein-Hawking 
entropy. 

The generalization to higher dimensional black holes 
not so firmly established, but there has been some significant progress.4 The key 
trick is to treat the horizon as a sort of boundary — not in the sense that matter 
cannot pass through it, but in the sense that it is a place where one must impose 
“boundary conditions,” namely the condition that it is a horizon. As in the BTZ 
case, the canonical generators of diffeomorphisms acquire boundary terms, and for 
a number of sensible “stretched horizon” boundary conditions these pick out a 
Virasoro subalgebra with a calculable central charge. Similar techniques have been 
used for the near-horizon geometry of the near-extremal Kerr black hole,!®° 
giving the expected Bekenstein—Hawking entropy. 


186-188 is more subtle, and 


again 


dSee Ref. 189 for a more thorough treatment. 
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9.3. Effective descriptions 


While conformal methods may allow a “universal” computation of Bekenstein— 
Hawking entropy, they do not tell us a lot about the quantum states of the black 
hole. In one sense, this is a good thing: The point, after all, is to explain why many 
different descriptions of the states yield the same entropy. Still, one may be able to 
learn something useful about “effective” descriptions. 

It is well known that the presence of a boundary, either at infinity or at a 
finite location, alters the canonical generators of diffeomorphisms by requiring the 
addition of boundary terms.®° The exact form of these boundary terms depend on 
the choice of boundary conditions, and the resulting generators necessarily respect 
these boundary conditions. As a result, some of the “gauge transformations” of the 
theory without boundary — those that change the boundary conditions — are no 
longer invariances of the theory. This boundary symmetry-breaking increases the 
number of states in the theory: States that would previously have been considered 
t.19! Moreover, these new states appear only at the 
boundary, since the gauge symmetry remains unbroken elsewhere. 

This phenomenon is not quite the same as the Goldstone mechanism,!"? but it 
is similar in spirit. By analogy, it suggests a possible effective description of black 
hole horizon states in terms of the parameters that label the boundary-condition- 
breaking “would-be diffeomorphisms.” For the case of the BTZ black hole, such a 
description can be made explicitly.89:!%? The effective field theory is a particular 
conformal field theory, Liouville theory, with a central charge that matches the 
Brown—Henneaux value. The counting of states in this theory is poorly understood: 
Technically, the relevant states are in the “nonnormalizable sector,” which is not 
under good control. But as discussed in Sec. 5.10, one can couple the Liouville 
theory to external matter and recover a correct description of Hawking radiation. 

The effective description described here is holographic, and is reminiscent of the 
“membrane paradigm” for black holes,!94:1% 
by a collection of degrees of freedom at the horizon. It is possible that a better 
understanding of these degrees of freedom could help us understand the information 
loss problem, the next topic of this review. 


gauge-equivalent are now distinc 


in which a black hole is also described 


10. The Information Loss Problem 


Let us turn finally to one of the most puzzling aspects of black hole thermodynam- 
ics,°° the “information loss paradox.” This is a topic that is very much in flux, 
with nothing near a consensus concerning its resolution. The purpose of this section 
is therefore simply to introduce some of the central ideas. 

Consider a process in which a shell of quantum matter in a pure state is allowed 
to collapse to form a black hole, which then evaporates completely into Hawking 
radiation. It appears that this process has allowed an initial pure state to evolve into 
a final highly mixed (thermal) state. But this is not possible in quantum mechanics: 
Unitarity requires that a pure state evolve to a pure state. If one thinks of the von 
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Neumann entropy as a measure of lack of information, this process has led to an 
unallowable loss of information. 

This argument contains several hidden assumptions, of course, any one of which 
could point to a resolution. We have assumed that a pure state can collapse to 
form a black hole; that the final state consists only of Hawking radiation, with no 
long-lived “remnants” of the black hole; that the final spacetime is simply nearly 
flat space, with no residue of the singularity and no “baby universes” that could 
swallow up lost correlations; and that the Hawking radiation is genuinely thermal, 
with no hidden correlations. Let us consider each of these in turn. 


10.1. Nonunitary evolution 


One simple answer to the information loss problem is simply that quantum mechan- 
ics is not quite correct, and the evolution is not unitary. Figure 6 shows a proposed 
Carter—Penrose diagram for an evaporating black hole. In this figure, it is clear 
where the “missing” information has gone: It has simply vanished into the sin- 
gularity. Of course, one might hope that quantum gravity would eliminate the 
singularity, but that need not guarantee that the lost correlations would have to 
reappear for a future observer. 

Restricted to black holes, such a loss of unitarity might seem innocuous. But 
there are presumably quantum amplitudes for any physical process that include 
virtual black holes, so the effect, though perhaps tiny, would be pervasive. It has 
been argued that such violations of unitarity lead to either violations of causal- 
ity or energy nonconservation at an unacceptable level,!®” but there are known 
counterexamples,!®* and the possibility remains open. 


10.2. No black holes 


We really understand the formation of a black hole only as a classical process, in 
which the collapsing matter is typically in a complicated thermal state. It is possible 


Fig. 6. A possible Carter—Penrose diagram for an evaporating black hole. 
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that matter in a pure state simply does not collapse to form a black hole. In the 
fuzzball picture of Sec. 7.3.2, for instance, the microscopic states are not black holes: 
Properties such as horizons appear only statistically as features of ensembles.!?! 

One can also write down “phenomenological” metrics for which the black hole 
singularity is replaced by a nonsingular de Sitter-like region.19? 2°? In such models, 
a true horizon never forms, but there can be a “quasi-horizon” that looks much like 
an event horizon for a long period of time. Recall from Sec. 5.1 that a horizon is not 
required for Hawking radiation — it is enough to have exponential blue shift and a 
slowly varying effective surface gravity — so for the most part, the thermodynamics 
will not change. But there is no longer a singularity at which correlations can be lost, 
and when the quasi-horizon eventually disappears, the information in the interior 
becomes accessible. 

Such models share some similarities with the “remnant” scenarios discussed 
below. In particular, there are unsettled issues concerning how fast the information 
can “escape” after the disappearance of a quasi-horizon. Nevertheless, they capture 
a feature that must be shared by almost any model of unitary evolution: If the 
black hole eventually disappears without any loss of information, then almost by 
definition there cannot have been a true event horizon. 


10.3. Remnants and baby universes 


Calculations of Hawking radiation seem likely to be reliable down to the length 
scales at which quantum gravitational effects become important. Beyond that scale, 
though, we do not know what to expect. In particular, it is possible that black hole 
evaporation might stop,2? leaving a black hole “remnant.” 

If remnants exist, they offer a new resolution to the information loss problem. 
A remnant could be correlated with the Hawking radiation, allowing the combined 
state to remain pure. Of course, this requires that remnants have a huge entropy, 
that is, a huge number of possible states. This is generally understood to be incom- 
patible with the AdS/CFT correspondence, and there is a danger of remnants being 
overproduced by pair production in the vacuum.?°4 On the other hand, particu- 
lar exact models of black hole evaporation do avoid the information loss problem 
through a remnant mechanism,?”> providing at least an existence proof. 

Remnants need not have infinite lifetimes. They, too, may eventually decay 
into something like Hawking radiation. There are, however, constraints on such 
a process, if unitarity is to be preserved.?°7:2°6 As an estimate of the lifetime,20” 
suppose the remnant has mass mp, and decays by emitting N photons. Each photon 
carries roughly one bit of information, and if the process is to preserve unitarity, 
their total entropy must be on the order of that of the original black hole. We thus 
require N ~ GM7?/h, so each photon should have an energy and wavelength 


hmpR GM? 
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For the photons to be uncorrelated, they should be emitted far enough apart that 
their wave packets do not significantly overlap. The total time for emission is thus 


4 
eee (=) Se (58) 
mpi) mR 
typically far longer than the age of the universe. By carefully analyzing the entan- 
glement entropy at future null infinity, Bianchi has found a similar time scale, 
although with a bit more flexibility depending on the details of the evaporation.2°° 
A related approach to the problem is the “baby universe” scenario.?!° While we 
might expect quantum gravity to eliminate the singularity in Fig. 6, the result could 
plausibly be a new region of the Universe that is causally disconnected from Zt — 
something one could perhaps think of as a peculiar, large remnant. In this case, 
unitarity is technically maintained, but it cannot be tested by a single observer. 


10.4. Hawking radiation as a pure state 


The most widely held expectation among physicists in this field is that the infor- 
mation loss problem will be resolved by a demonstration that Hawking radiation 
has subtle hidden correlations and is actually in a pure state. This belief gains sup- 
port from the AdS/CFT correspondence: In the setting in which we think we best 
understand quantum gravity, the boundary conformal field theory is unitary, and 
there does not seem to be room in its Hilbert space for remnants. At first sight, 
this idea does not seem unreasonable. The density matrix of a pure state can be 
very close to a thermal density matrix,?? differing only by terms of order e~*. 
But while the difference between a pure state and a thermal state may be very 
small, it can still be large enough to cause trouble. Consider two Hawking quanta, 
one “early” and one “late,” emitted by a black hole, as shown in Fig. 7. As in 
Sec. 4.2, each escaping particle is accompanied by a “partner” inside the horizon. 
In order for a freely falling observer to see the standard vacuum near H, particle B 
must initially be entangled with its “partner” A. This correlation is part of the basic 
structure of the quantum vacuum. It can be violated only at the cost of introducing 
large vacuum expectation values of such quantities as the stress—energy tensor. 


H. exterior 


~ 
m4 


Fig. 7. Early and late Hawking quanta emitted from near the horizon H. 
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On the other hand, if the Hawking radiation is to be in a pure state in the 
far future, particle A must be entangled with other particles in the radiation — 
for simplicity, say particle C. But this violates a fundamental result of quantum 
mechanics, “monogamy of entanglement,” which states that a state cannot be simul- 
taneously maximally entangled with two others.?!! More technically, the problem 
can be restated as a violation of the strong subadditivity of entropy.° 

Of course, two quantum states that are initially entangled need not remain so, 
and it might be possible to transfer the entanglement of particle B from A to C. 
But A and C are not causally connected — they are separated by a horizon — 
so such a process would have to be nonlocal.?!* One might respond that quantum 
gravity is always nonlocal,?!*:!4 but it remains unclear whether “enough” nonlocal- 
ity can be made compatible with local effective field theory without doing violence 
to our notions of the scales at which quantum gravitational effects ought to be 
important. 

For a slightly different perspective,?!° consider the “membrane” description of 
effective degrees of freedom of Sec. 9.3, or more generally a holographic picture in 
which the black hole is described by horizon degrees of freedom. As particle A enters 
the black hole, its state should be captured by these horizon degrees of freedom, 
which could then be correlated with the later emission of particle C. An infalling 
observer, on the other hand, should be able to see particle A inside the horizon. 
This would seem to violate the “no cloning” theorem of quantum theory,?!® which 
prohibits the duplication of a state. The notion of “black hole complementarity” is 
that this need not be a problem, because no single observer can measure both states. 
But in a much-cited paper, Almheiri et al. have disputed this,° arguing instead that 
the entanglement across the horizon is likely to be broken; see also Braunstein?!” 
for a similar prediction. The resulting large expectation values of the stress—energy 
tensor would then form a “firewall” for any infalling observer. As Almheiri et al. 
point out, this implies a breakdown of the principle of equivalence near a horizon, 
a rather dramatic claim. At this writing, the controversy is far from being settled. 


11. Conclusion 


Classically, black holes are very nearly the simplest structures in general relativity. 
With the advent of black hole thermodynamics, however, we have come to see them 
as highly complex thermal systems, rarely at equilibrium, with a truly remarkable 
number of internal states. 

Perhaps most surprisingly, we have learned that ordinary methods of general 
relativity and quantum theory — quantum field theory in curved spaces, WKB 
approximations, semiclassical path integrals, and the like — allow us to probe prop- 
erties of quantum gravitational states. Such a claim should be greeted with skepti- 
cism, of course. But over the past few decades we have gathered such a weight of 
self-consistent results, based on enough different and independent approximations, 
that it seems almost certain we are seeing something real. 
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At the same time, black hole thermodynamics is having a profound impact on the 
rest of physics. The holographic conjecture suggests that our fundamental notions of 
local physics, and perhaps of space and time, are only low energy approximations. 
Attempts to understand black hole universality, while less sweeping, also point 
to the key role of “boundary” states. The information loss problem has led us 
to question basic aspects of quantum mechanics, and has exhibited remarkable 
connections among very different aspects of physics. And perhaps in the future, 
black hole thermodynamics will tell us something profound about quantum gravity. 
We have interesting times ahead. 
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Appendix A. Classical Black Holes 


The Schwarzschild and Kerr solutions, discussed elsewhere in this book, are the 
archetypes of black holes. For black hole thermodynamics, though, we need more 
general configurations: “dirty” black holes whose horizons are distorted by nearby 
matter, dynamical black holes formed from collapsing matter, black holes in other 
spacetime dimensions. For this, we need a generalization of the notion of a horizon. 

A black hole is defined as a “region of no return,” an area of spacetime from 
which light can never escape. This idea can be made more precise — see Hawking 
and Ellis?!® for details — by considering a spacetime whose Carter-Penrose diagram 
looks roughly like that of Fig. 1, in the sense that it contains a future null infinity 
T* that describes the “end points at infinity” of light rays. Points in the causal 
past of Zt are ordinary events from which light can escape to infinity. Points that 
do not lie in the past of Z* are cut off from infinity, and form a black hole region. 
The dividing line — the boundary of the past of Z* — is the event horizon; in 
Fig. l(a), it is the dashed line labeled H. 

The event horizon defined in this way has many interesting global properties. 
For instance, it cannot bifurcate or decrease in area.?!® But it might be argued that 
this definition does not quite capture the right physics. The problem can be traced 
to the word “never” in the phrase “light can never escape” — to truly make such a 
statement, one must know about the entire future. The event horizon is teleological 
in nature: it is determined by “final causes,” events in the indefinite future. 

Imagine, for instance, that the Earth is sitting at the center of a highly energetic 
incoming spherical shell of light, one so energetic that it has a Schwarzschild radius 
of one light year. To be sure, this is not likely, but we also cannot rule out the 
possibility observationally: no signal can propagate inward faster than such a shell, 
so we would not know of its existence until it reached us. Suppose now that the 
shell is presently two light years away, collapsing inward toward us at the speed of 
light. Step outside and point a flashlight up into the sky. One year from now, the 
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light from your flashlight will have traveled one light year, where it will meet the 
collapsing shell just as the shell reaches its Schwarzschild radius. At that point, the 
light will be trapped at the horizon of the newly formed Schwarzschild black hole, 
and will be unable to travel any farther outward. In other words, in this scenario 
we are already at the event horizon of a black hole, even though we will see no 
evidence for this fact until we are suddenly vaporized two years from now. 

This seems odd; surely Hawking radiation “now” should not need to know about 
the infinite future. A number of attempts have been made to find more local versions 
of the event horizon,?!® leading to a variety of more or less useful definitions. The 
most useful for our purposes is the “isolated horizon,”® a locally defined surface 
with properties suitable for describing equilibrium black holes. 

An isolated horizon is a null surface — a surface traced out by light rays — 
whose area remains constant in time, as the horizon of a stationary black hole does. 
A thought experiment may again be helpful. Picture a spherical lattice studded 
with flashbulbs, set to all flash simultaneously in the lattice rest frame. The bulbs 
will produce two spherical shells of light, one traveling inward and one traveling 
outward. As we know from our experience in nearly flat spacetime, the area of 
the ingoing shell will normally decrease with time, while the area of the outgoing 
shell will increase. If the lattice is placed at the horizon of a Schwarzschild black 
hole, though, it is not hard to show that the area of the outgoing shell will remain 
constant. Similarly, if the lattice is placed inside the horizon, both shells will decrease 
in area.® 

To express this idea mathematically, let us first define a nonexpanding horizon 1 
in a d-dimensional spacetime to be a (d—1)-dimensional submanifold such that? °° 


(1) # is null: that is, its normal vector @, is a null vector; 

(2) the expansion of H vanishes: Jig) = g?’Valy) = 0, where qap is the induced 
metric on H; 

(3) the field equations hold on H, and —T%,¢° is future-directed and causal. 


The first condition expresses the fact that the horizon is traced out by light rays. 

The expansion Vg) is the logarithmic derivative of the area of a cross-section of 

H,?*° so the second condition implies time independence of the horizon area. The 

third condition is a relatively weak prohibition of negative energy at the horizon. 
These three conditions imply that 


Vil =wal? onH 


for some one-form w,. The surface gravity K¢) for the normal ¢* — the quantity 
that appears in the Hawking temperature (1) — is then defined as 


Ke) = LW. (A.1) 
©The outgoing shell must still be outgoing with respect to the lattice, of course; as the lattice itself 


collapses inward, its area will decrease even faster than that of the “outgoing” shell of light. This 
is one way of understanding why any object inside a black hole must necessarily collapse inward. 
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The normal £% is not quite unique, though: if @* is a null normal to H and X is 
an arbitrary function, then e* is also a null normal to H. We can reduce this 
ambiguity by demanding further time independence: a weakly isolated horizon is 
one for which, as an added condition, we require that 


(4) Loew =OonH, 


where CL is the Lie derivative. This constraint implies the zeroth law of black hole 
mechanics, that the surface gravity is constant on the horizon. 

Even with this fourth condition, the null normal @* may be rescaled by an 
arbitrary constant. This also rescales the surface gravity, so the numerical value of 
Kg) is not uniquely determined. At first sight this seems to be a terrible feature, 
but it reflects a genuine physical ambiguity: even at a horizon, the choice of time 
coordinate is not completely fixed. In fact, the first law of black hole mechanics (2) 
requires such an ambiguity: the scale of the mass M is also fixed only after one has 
normalized the scale of time at infinity. 

To clarify this issue, it will help to take a small detour. For a stationary black 
hole, another type of a horizon can be defined. A Killing horizon is a (d — 1)- 
dimensional submanifold Hx such that 


(1) some Killing vector .% is null, that is, y¥a.*% = 0; and 
(2) Hx is itself a null surface, that is, its normal vector is null. 


If both conditions hold, it follows that x is itself a null normal to H,«, and that 
x°Vax’ = Hise” on Hx. (A.2) 


Note also that the expansion g7°V, yp is automatically zero on Hx, simply by virtue 
of the Killing equation Vaxo + Voxa = 0. 

In this case, the null vector @* that defines an isolated horizon may be chosen 
to equal vy" at 7x, and the two horizons coincide. This does not quite solve the 
problem of normalization: a Killing vector also requires normalization, since if y* 
is a Killing vector, so is cy". But in the stationary case, the normalization of y* 
can be fixed at infinity, for example by requiring that x* ~ 1. In other words, 
for stationary black holes one can use global properties of the spacetime to adjust 
clocks at the horizon by comparing them with clocks at infinity. 

If, on the other hand, one is only concerned with physics at or near the horizon, 
the normalization of «(~) becomes more problematic. One can use known properties 
of exact solutions to express the surface gravity in terms of other quantities at 
the horizon,??! thereby fixing £7, but so far the procedure seems rather artificial. 
Alternatively, one can simply accept the ambiguity, and note that other quantities 
such as quasilocal masses defined near 1 require a similar choice of normalization. 

As noted in Sec. 2, weakly isolated horizons obey the four laws of black hole 
mechanics, the second law in the equilibrium form that the horizon area remains 
constant. Generalizations to dynamical, evolving horizons are also possible,?2! and 
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may be used to prove the inequality version (3) of the second law. These generaliza- 
tions also provide a potential setting for nonequilibrium black hole thermodynamics, 
allowing us, for instance, to describe flows of energy and angular momentum that 
include the contribution of the gravitational field. 
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This paper presents an “in-a-nutshell” yet self-contained introductory review on loop 
quantum gravity (LQG) — a background-independent, nonperturbative approach to a 
consistent quantum theory of gravity. Instead of rigorous and systematic derivations, it 
aims to provide a general picture of LQG, placing emphasis on the fundamental ideas and 
their significance. The canonical formulation of LQG, as the central topic of the paper, 
is presented in a logically orderly fashion with moderate details, while the spin foam 
theory, black hole thermodynamics, and loop quantum cosmology are covered briefly. 
Current directions and open issues are also summarized. 
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1. Introduction 


Quantum gravity (QG) is the research in theoretical physics that seeks a consis- 
tent quantum theory of gravity. It is considered by many as the open problem 
of paramount importance in fundamental physics, as its task is to unify quantum 
mechanics (more specifically, quantum field theory, QFT) and general relativity 
(GR), which are the two greatest theories discovered in the twentieth century and 
have become the cornerstones of modern physics. 

Ever since it was recognized that the gravitational field needs to be quantized, 
the quest for a satisfactory quantum description of spacetime has never stopped, 
and gradually QG has grown into a vast area of research along many different 
paths with various doctrines. The two most developed approaches are string theory 
and loop quantum gravity (LQG). Other directions include causal sets, dynamical 
triangulation, emergent gravity, H space theory, noncommutative geometry, super- 
gravity, thermogravity, twistor theory, and many more. (See Refs. 1-4 for surveys 
and more references on different theories of QG.) 

Among other approaches, the strength of LQG is that it provides a compelling 
description of quantum spacetime in a nonperturbative, background-independent 
fashion. The beauty lies in its faithful attempt to establish a conceptual framework 
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that consolidates the apparently conflicting tenets of QFT and GR. LQG deliber- 
ately adopts the “minimalist” approach in the sense that it focuses solely on the 
search for a consistent quantum theory of gravity without requiring any extraor- 
dinary ingredients such as extra dimensions, supersymmetry, and so on (although 
many of these can be incorporated compatibly). Unlike string theory, the aim of 
which is much more ambitious, LQG does not intend to address the unification 
problem of finding a “theory of everything” in which all forces, including gravity, 
are unified. The central problem confronted by LQG simply is: What is the consis- 
tent quantum field theory of which the low-energy limit is classical GR, if it exists 
after all? 

Having been developed about 25 years (see Ref. 5 for its history and current 
status), LQG has faced many challenges and achieved encouraging progress. Many 
long-standing open problems within the approach have been solved, and LQG now 
provides a rigorous mathematical foundation for QG. One of the main results is 
the discovery that the spectra of area and volume are discrete and the correspond- 
ing quantum states, called spin networks, reveal the microscopic structure of space 
that is granular in Planck scale. The discreteness of space is not postulated ad hoc 
but a direct consequence of quantization in the same nature as the discrete levels 
of energy in an atom. Because of the discreteness, LQG enables one to derive the 
Bekenstein—Hawking entropy of black holes from first principles, offering a micro- 
scopic explanation of its proportionality to the area of the horizon. Furthermore, the 
construction of LQG relies heavily on diffeomorphism invariance. Once the gauge 
degrees of diffeomorphism are factored away, the resulting quantum states, called 
s-knots, are not to be interpreted as quantum excitations on a space but, rather, 
of the space itself, because any external reference to localization of quantum states 
becomes irrelevant and only their contiguous relation remains significant. In terms 
of this relational structure, LQG profoundly manifests background independence — 
the key principle of classical GR — in the context of QFT. With many successes 
achieved, the major weakness of LQG is in our limited understanding about its 
quantum dynamics and low-energy physics, but these two aspects have been inten- 
sively investigated and some promising solutions have emerged. 

LQG is closely related to the spin foam theory, an alternative approach of QG. 
While LQG is based on the canonical (Hamiltonian) formalism, the spin foam theory 
can be viewed as the covariant (sun-over-histories) approach of LQG. The history 
of a spin network (more precisely, an s-knot) evolving over time is called a spin 
foam, which represents a quantized spacetime in the same sense that a spin network 
represents a quantized space. The transition amplitude is given by a discrete version 
of path integral, which is the sum, with proper weight factors, over all possible spin 
foams sending the initial spin network to the final one. Extensive research on the 
spin foam theory has made rapid advances, suggesting that a specific class of spin 
foam models might provide a covariant definition of the LQG dynamics. 

Applying principles of LQG to cosmological settings, one obtains loop quantum 
cosmology (LQC). LQC is a symmetry-reduced model of LQG with only a finite 
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number of degrees of freedom. The simplification of symmetry reduction makes 
possible a detailed investigation on the ramifications of loop quantum effects, many 
of which remain obscure in the full theory of LQG. One of the striking results of LQC 
is that the big bang singularity is resolved by the loop quantum effects and replaced 
by a quantum bounce, which bridges the present universe with a preexistent one. 
The bouncing scenario opens up a new paradigm of cosmology. Research of LQC 
has led to many successes and represents a very active research field in recent years. 

There are many books and review articles available on LQG,® !° the spin foam 
theory,!4!” and LQC.!® ?! This paper serves as an “in-a-nutshell” yet self-contained 
introductory review on LQG. It is centered on the canonical formulation of LQG 
with moderate details but covers the spin foam theory, black hole thermodynamics, 
and LQC only very briefly. 

The paper is organized as follows. After a quick review on the problems of QG 
in Sec. 2, the main topic of canonical approach of LQG is presented in Secs. 3-8 in 
a logically orderly fashion. Additionally, the spin foam theory, black hole thermo- 
dynamics, and LQC are briefly described in Secs. 9-11, respectively. Finally, Sec. 12 
outlines the current directions and open issues. 

A large part of this paper is heavily based on Refs. 8 and 9 without delving into 
all technicalities. Conventions are given as follows. The signature of the spacetime 
metric gag is given by (—,+,+,+). The Greek letters in lower case are used for the 
coordinates of the four-dimensional spacetime manifold, while the Latin letters in 
lower case starting a,b,... are for the coordinates of the three-dimensional space 
manifold. The Latin letters in upper case starting J, J, K,... = 0,1, 2,3 are used for 
the four-dimensional “internal space” (the vector space on which the Lorentz group 
SO(3, 1) is represented), while the Latin letters in lower case starting i,j,k,... = 
1, 2,3 are for the three-dimensional internal space (the vector space on which SU (2), 
the covering group of SO(3), is represented). In case the Euclidean GR, in place of 
the Lorentzain GR, the metric signature is given by (+,+,+,-+) and the Lorentz 
group SO(3,1) is replaced by the Euclidean rotational group SO(4). The speed 
of light is set to c = 1, but we keep the (reduced) Planck constant fh and the 
gravitational constant G explicit. 


2. Motivations 


Despite the fact that enormous efforts in various approaches have been devoted 
for QG, it remains the Holy Grail of theocratical physics. In the section, we 
briefly discuss the conceptual problems in QG. (See Refs. 3, 4 and 22 for full 
elaboration.) 


2.1. Why quantum gravity? 


The very first question about QG is: Why do we even bother to quantize gravity at 
all? Apart from many aesthetic considerations for an elegant unification of known 
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fundamental physics, the logical necessity of a quantum description of gravity fol- 
lows from the conflicts between the two fundamental theories of GR and QFT. The 
fundamental theories collide in the classical Einstein field equation 


1 
Ryuv — pF Iu = 81GT ww, (1) 


which relates the dynamics of nongravitational matter (in terms of T),,,) described 
by the theory of QFT to the dynamics of geometry (in terms of g,,,) described by 
the theory of GR. Note that the metric g,, enters the definition of the energy— 
momentum tensor via T,,, = Ty(g) = 6S/dg"” (S is the action) and that QFT is 
defined only on a given (flat or curved) background. 

As matter fields are quantized in QFT, the only way to make sense of Eq. (1) 
without quantizing g,, is to start with a given background go and replace the right- 
hand side with the expectation value ( ian (go)). Once we solve the classical field gj. 
by this prescription, we treat the solution as the new background and iterate the 
procedure. However, the iteration is not guaranteed to yield convergent solutions 
and thus leads to inconsistency. In fact, one cannot consistently couple a classical 
system to a quantum one,*?? even though QFT in curved spacetime provides an 
adequate effective theory in low-energy regime. 

The conflicts force one to quantize both geometry and matter fields simultane- 
ously or to invoke other more radical modifications. 


2.2. Difficulties of quantum gravity 


What after all are the difficulties that prevent one from quantizing both geometry 
and matter easily at the same time? It turns out the difficulties lie at the very 
contradictory difference between the fundamental principles of GR and QFT. 

In GR, spacetime is dynamical as well as matter fields. Matter and geometry are 
essentially on the equal footing in this regard. In QFT, by contrast, all dynamical 
variables are quantized except spacetime. Spacetime is treated as a given fixed back- 
ground, which parametrizes all dynamical fields and provides the a priori causal 
structure needed for field quantization. Therefore, one encounters the stark paradox 
when joining GR and QFT together. 

There are various approaches trying to circumvent the paradox, differing on 
which features of GR and QFT are viewed as fundamental and unchangeable and 
which features as inessential and modifiable, as surveyed in Refs. 1-4. 


2.3. Background-independent approach 


In conventional QFT, dynamics of dynamical fields takes place on the “stage” of 
a fixed spacetime with given metric. By analogy, dynamical fields are animals that 
roam around and chase one another upon the stage of an island of spacetime. 

GR, on the other hand, has taught us a very different paradigm of back- 
ground independence, according to which spacetime is itself a dynamical entity 
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(gravitational field) in many respects the same as other nongravitational entities. 
That said, physical entities, both gravitational and nongravitational, are not resid- 
ing and moving in spacetime but, rather, they reside and move on top of one 
another. In the words of Rovelli’s metaphor®: “It is as if we had observed in the 
ocean many animals living on an island: animals on the island. Then we discover 
that the island itself is in fact a great whale. So the animals are no longer on the 
island, just animals on animals.” 

Among various theories of QG, LQG is special in the sense that it reveres the 
paradigm of background independence earnestly and aims to formulate a noncon- 
ventional QFT of GR in genuine conformity with the paradigm, essentially in a 
nonperturbative fashion, so as to reconcile the aforementioned paradox. This of 
course is a very challenging task and the first step is to begin with the “proper” 
mathematical framework, which turns out to be the canonical description of con- 
nection dynamics of GR in terms of Ashtekar connection. The shift from metric to 
Ashtekar connection opens the possibility of employing nonperturbative techniques 
in gauge theories (especially, lattice gauge theories) to QG. 


3. Connection Theories of General Relativity 


Classical GR is usually presented as geometrodynamics, i.e. a dynamical theory of 
metrics, but it can also be recast as connection dynamics, i.e. a dynamical theory 
of connections. The merit of this reformulation is to cast GR in a language closer to 
that of gauge theories, for which quantization is better understood. In this section, 
we present a brief introduction to connection theories of GR, largely excerpted from 
Sec. 2 of Ref. 8, and address some remarks in Sec. 3.3. For introductory purpose, 
we will focus only on the gravitational part of the action and phase space. For 
inclusion of matter and the cosmological constant, see e.g. Ref. 24 (also see Sec. 7); 
for extension to supergravity and other spacetime dimensions, see e.g. Refs. 25-27 
(also see Sec. 12.14). 


3.1. Connection dynamics 


The most widely known formulation of classical GR is the dynamical theory of met- 
ric Juv, Which is based on the Riemannian geometry and governed by the Einstein— 
Hilbert action 


Sl= sag f tevialR, ) 


where R is the Ricci scalar and g is the determinant of g,,,. Classical GR also admits 
many other formulations, which are equivalent to one another to various extent. 
In the Palatini formulation?®:?° based on the Riemann—Cartan geometry, instead 
of guy, the basic gravitational variables are taken to be the coframe field e,,’, which 
gives the orthonormal cotetrad (or vierbein), and the so(1,3)-valued connection 
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1-form field w,,/ y which corresponds to the gauge group SO(3, 1) of local Lorentz 
transformation. The Palatini action is given by 


1 
Sle, w] = song | erase! Nel AOR, (3) 
where 
Q:=dw+twAw (4) 


is the curvature of the connection age y The coframe a determines the spacetime 
metric via 


Guy = ren’ ty" (5) 
Meanwhile, the connection w,,/ , is completely determined by the coframe via 
de+wAe=0, (6) 


which is the equation of motion obtained by varying the Palatini action with respect 
to w,. As far as equations of motion are concerned, the Palatini action Eq. (3), 
imposed by Eq. (6), reduces to the familiar Einstein—Hilbert action Eq. (2). Thus, 
these two actions lead to the same theory at least at the classical level. 

It is tedious yet straightforward to perform the Legendre transform on the Pala- 
tini action to obtain the Hamiltonian theory,®° but the resulting theory has certain 
second-class constraints. When the second-class constraints are solved, one is led 
to the standard Hamiltonian description of geometrodynamics (in terms of cotri- 
ads) and thus loses all reference to connection dynamics; meanwhile, the form of 
geometrodynamics is rather complicated and seems insurmountable for quantum 
theory. This problem can be avoided by modifying the Palatini action with Holst’s 
augmentation®! as 


1 
Slew] = gaz f exaxne! Ae! AN — fe ne! AM (7) 


where ¥ is an arbitrary but fixed number, called the Barbero-Immirzi parameter. 
The celebrated feature is that this augmentation does not change the equations 
of motion of Eq. (3). Inclusion of Holst’s modification renders it more elegant to 
perform the Legendre transform in the sense that all constraints are first class (as 
we will see shortly) and consequently makes the quantum theory founded on the 
Hamiltonian description more manageable as significance of connection dynamics 
is retained. (While different numerical values of y yield equivalent theories at the 
classical level, it should be noted that, in the quantum theory, they give rise to 
different sectors that are not unitarily equivalent to one another as will be shown 
in Sec. 5.) 

In the non-abelian gauge theory, the Yang—Mills action can also be augmented 
with a topological term called the 6-term as 


1 . 9 
Sao fIrar+s, [PAF (8) 
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where F is the field strength (curvature) of the Yang-Mills connection, * is the 
Hodge dual, and a and @ are known as the coupling constant and the 6-angle respec- 
tively.* It is noticeable that Eq. (7) bears close resemblance to Eq. (8). Whether 
this intriguing resemblance carries any profound meaning arouses a lot of curiosity. 
LQG, in particular, exploits the resemblance to the extent that we can adopt the 
nonperturbative techniques (such as the quantization scheme in terms of Wilson 
loops) well developed in the context of gauge theories. 


3.2. Canonical (Hamiltonian) formulation 


We have reformulated the classical theory of GR in a fashion very close to non- 
abelian gauge theories. Prior to quantizing the theory, we have to take a further step 
to translate the covariant description into the canonical (or Hamiltonian) descrip- 
tion in order to apply the standard scheme of equal-time quantization used in QFT. 

First, we apply the ADM foliation, which foliates the four-dimensional spacetime 
manifold M into a family of spacelike surfaces (called “leaves” ) ¥,, labeled by a time 
coordinate t and with spatial coordinates on each slice given by x’. Under the ADM 
foliation, variables on the spacetime manifold are split into 3+1 decomposition. The 
coframe (cotetrad) e,,/ is split into the cotriad e', plus the lapse function N and 
the shift vector field N°. The lapse and shift describe how each of the leaves 4; are 
welded together in the foliation.” Correspondingly, the four-dimensional metric gj 
of M is split into the three-dimensional metric gay of & plus N and N°. Similarly, 
apart from (w’- t) := —3e'!*w;,, +t, the so(3,1)-valued connection 1-form w/7 on 
M is naturally decomposed into two su(2)-valued 1-forms on ©: T = I. 7jdx* and 


K = K'r,dx%, where 7; = T' = o;/(2i) are the generators of SU(2) with o; being 


the Pauli matrices. As a consequence of Eq. (6), I is the su(2) & so(3) spin 
connection on ©; associated with the cotriad e?,, i.e. 
je +241" Ae” =O, (9) 


and K* is the extrinsic curvature of ©; imbedded in M. The 3+1 splitting is 
summarized in Table 1. 
Performing the Legendre transformation on the Palatini—Holst action Eq. (7), 
we obtain 
1 
81Gy 


eh : ae ; 
ical da BY L.A‘ — h(E, Ai, N, N%, (w*- »)), (10) 
Xe 
where £ is the Lie derivative, and the Hamiltonian density h is given by 
h = (w' -t)G;+N°C, + NC. (11) 
Analogous to the Barbero-Immirze parameter y, different numerical values of 0 yield equivalent 
theories at the classical level but give rise to inequivalent 6-sectors in the quantum theory. 


bMore precisely, let n® be the unit vector normal to the slice N;, we have t® = Nn® + N® with 
N°? ns = 0. 
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Table 1. Decomposition under the ADM foliation. 


M = De 
| hy eee = Di ac 
I,J,...=(0,1,2,3) 7 i, j,... = (1, 2,3) 
s0(3, 1) = 80(3) & su(2) 
eu" = et, N, N@ 
Iv ml dav, N, N® 
wy" y => Tey a ee vt) 
deb +ulzyAe7=0 = de’ + e'5,T* Ac* =0 


The canonical pair (Ai, E;) is given by the Ashtekar connection (1-form) 
Ai (2) = T(z) + ¥Ka(e) (12) 


and the densitized inverse triad (vector density of weight 1) 


~ a1 1; 
i= 5 ebece Cask, (13) 


which gives the three-metric gay via 


with 
q = |det qap| = det E. (15) 


The Hamiltonian density h is given by three (local) constraints G; = 0 (Gauss 
constraint), C, = 0 (diffeomorphism constraint), and C = 0 (scalar constraint) 
associated with the Lagrange multipliers (w' - t), N%, and N, respectively. The 
three constraints are given by 


Gi = DaE; = O.E; + 64" ALE, (16a) 
Gai), — EG, (16b) 

_— _ Yat pl ig. pk ae i qed 

C= 5a i Pies €4,F i +(o0-7¥ )2Kj. Kj 

qd 
2 E; i 
+ 81G(y ares 7)0a Vidi G 7 (16c) 
q 
where F'* is the curvature of the connection A’ and o = —-1.° 


°Had we considered the Euclidean GR instead of the Lorentzain GR, the Lorentz group SO(3, 1) 
would have been replaced by the Euclidean rotational group SO(4). Correspondingly, 7 = +1 and 
Ai :=Tt —oyKi =I, — 7K} for the Euclidean case. 
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The fundamental variables are A‘, and EY. which satisfy the canonical relation 
i rb i 
{Ai (2), Bj(y)} = 8aGy 556207 (a« — y), (17) 


and all three constraints can be expressed in terms of A‘ and ES 4 For any smooth 
su(2)-valued field \ = X'7; on ©, the functional 


Cq[d] := a [ PxriG; (18) 


generates precisely the infinitesimal internal SU(2) gauge rotations by A 
{Ai,Co[]}=—Dar* and {7 ,Co[A} = Gi" Ej. (19) 


Consequently, the Gauss constraint G; = 0 is the “Gauss law” that ensures the 
gauge invariance under the internal SU(2) rotations. Furthermore, removing from 
C® and C a suitable multiple of the Gauss constraint, for any smooth vector field 
N* and scalar field NV, we can show that the functional 


1 hi 
Coie [N] := da(N° EF, — (N° A‘)G; 20 
pial] = ga | da NBL, — (NAIC) (20) 
generates the infinitesimal 3d diffeomorphism (Diff) on © along the direction N 
{A*Cpig[N]} = Ln Ai, and {7 ,Cpig[N]} = CNE;, (21) 
and the functional 
~a ~b 
1 E,E,... a 
= PsN [e4 .F* —7")2Ki Ke 22 
[ | 167G S zt Tal k ot (o a) [a Al ( ) 


generates the infinitesimal translation in time along the direct Nn®: 
{Ai,CIN]} =LwnoAi and {E;,CIN]} = LynoE;, (23) 


where CL is the Lie derivative. Altogether, the three constraints ensure invariance 
under the SU(2) gauge rotation, the diffeomorphism on ©, and the translation in 
time. 

The three constraints are first class in Dirac’s terminology; that is, the Poisson 
bracket of any two of the constraints vanishes in the restricted phase space imposed 
by the three constraints. Explicitly, the Poisson brackets between the constraints, 
which give the so-called “constraint algebra,” read as‘ 


{CalA], Ca[N']} = Coll, 9], (24a) 


4Note that Ki = (PT? — A*)/(oy) and Tt is a non-polynomial function of £7. 

©Tt should be noted that Eqs. (21) and (23) are valid only if A‘ and Ej already satisfy the Gauss 
constraint, because we have removed a multiple of the Gaussian constraint from C® and C. 
fNote that the constraint algebra Eq. (24) is not closed in the usual sense, as the arguments of 
Cpig and Cg on the right-hand side of Eq. (24f) involve the phase space variables Ag and E*. 
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{Ca[A], Cie [N] } = —Ca[LnA], (24b) 
{Ca[A], C[N]} = 0, (24c) 
{Cpise[N], Coie [N’]} = Coie[[N, N’]] = C[CNN’] = -C[Ln/N], (24d) 
{Coit [N], C[M]} = C[LnM], (24e) 

{C[N],C[M]} = o(Coie[S] + Ca[S* Aa) 
ac a Collq\1[B°0,N, B’O,M]], (248) 


where in the last equation the vector field S® is given by 
5B) BS : 
iS? = (NOM — M0O,N)———+ iq l =O q* (NOM = MO,N). (25) 


While the three constraints have to be satisfied, the evolution of the canonical 
pair is dictated by the Hamilton’s equations 

Ai 

Ot 

where the (total) Hamiltonian is H := (87Gy)~' J, d’xh.8 Because all three con- 


={Ai,H} and Ft = (8%, Hy, (26) 


straints are first class, once all of them are satisfied for an initial state of (At, E E'), 
they will continue to be satisfied under the evolution of Eq. (26). The two equations 
of motion in Eq. (26) together with the three constraints G; = 0, C, = 0, C = 0 
are completely equivalent to the Einstein field equation. 


3.3. Remarks on connection theories 


Historically, the Ashtekar connection was not obtained by starting from the 
Palatini-Holst action. Rather, the original idea®* is to replace the Lorentz con- 


nection w,/7 in the Palatini theory with the complex Lorentz connection A,,‘? 
defined as 
1 4 
A 5 (u," =5 Maguay), (27) 


which satisfies the self-dual condition: $7 «,A,,'" = iA,'”. Correspondingly, 
OQ in Eq. (3) is replaced by F/7, which is the curvature of the connection A‘7 
and can also be viewed as the self-dual part of 017, ie. FIT = $017 —tel7 OY". 
It turns out, in terms of the complex connection, all equations in the classical 
theory are simplified dramatically,°? as the new variable manifests the fact that 
s0(3, 1)c is isomorphic to su(2)c @ su(2)c (where gc denotes complexification of g). 


®This Hamiltonian is peculiar and sometimes referred to as “super-Hamiltonian” in the sense that 
it vanishes identically when the three constraints are imposed. The fact that the Hamiltonian 
vanishes is a characteristic of reparametrization-invariant theories. 
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The essence of using the new variable is closely related to the constructions of the 
twistor theory** and the H space theory.” 

However, the complex connection takes values in the Lie algebra of a noncompact 
group, posing an obstacle to constructing the corresponding quantum theory. It is 
Holst?! who first realized that the imaginary number 7 in Eq. (27) can be replaced 
by any (real or complex) parameter y and this prescription corresponds to adding 
the second term in Eq. (7). Since then, most progress toward the quantum theory 
has been made with real y. Taking y = +i in the Lorentzian GR (o = —1) and 

= +1 in the Euclidean GR (a = +1), one encounters the special cases, in which 
the (classical) theory has a richer geometrical structure — particularly, the three 
constraints in Eq. (16) are drastically simplified (note o — y? = 0) and the Poisson 
brackets in Eq. (24) form a closed Lie algebra. 

Rigorously speaking, the formulation we adopted is not a theory of “connection” 
dynamics but of “coframe-connection” dynamics, as we also include the coframe e,,/ 
‘J as the fundamental variables. Connection and 
coframe are the gauge variables in the Poincaré gauge theory of gravity (see Ref. 36 
for a review). If one postulates a Lagrangian quadratic in the gauge variables for 
the Poincaré gauge theory, the most general Lagrangian is given in Eq. (5.13) of 
Ref. 36 


in addition to the connection w,, 


1 
S=——; R+boX —2A 
lonG / (aoR + bo )n 
+12 more terms quadratic in curvature, torsion, and X, (28) 


where R := Rog? is the Ricci scalar, X := FNepy6 RO, A is the cosmological 
constant, and 7 is the volume four-form. The Palatini-Holst action Eq. (7) only 
takes the first line of Eq. (28) with ag = 1, bo) = y~! (and usually A = 0)." From the 
standpoint of the Poincaré gauge theory, it seems rather ad hoc to include Holst’s 
term, but neglect the second line completely, since Holst’s term is equivalent to a 
certain torsion square term via Nieh—Yan identity. Indeed, it would be theoretically 
more compelling if one could construct the quantum theory based on the generic 
Lagrangian Eq. (28), but inclusion of any terms in the second line unfortunately 
makes the constraints in the canonical description very complicated and no longer 
first class, rendering the Hamiltonian formulation inadequate for quantization. The 
fact that the Palatini-Holst action per se is privileged seems to suggest that, even 
with real y, the su(2)c @ su(2)c structure of the complex Lorentz algebra still plays 


hThe Lagrangian with the first two terms ~agR-+ boX was first discussed by Hojman et al.,37 
but the second term ~boX is referred to as Holst’s modification in the literature of LQG, mainly 
because Holst was the first to relate it to the Ashtekar variable. Likewise, the coframe-connection 
formulation of the first term ~aoR is known as the Einstein—Cartan or the Einstein—Cartan— 
Sciama—Kibble theory in the literature of connection theories (see Ref. 36 for a historical account) 
but often referred to as the Palatini formulation in LQG, partly because Palatini’s particular idea 
was underscored in Ashtekar’s seminal paper Ref. 32. Although the eponyms adopted in LQG are 
not historically faithful, we adhere to them in conformity with the LQG convention. 
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a profound role in QG (as advocated by the twistor theory and the H space theory), 
although its direct relevance remains obscure.' 

Finally, it should be noted that in the case of vacuum the Barbero—Immirzi 
parameter y does not appear in the equations of motion and thus have no phys- 
ical effect at the classical level, but this is not true in general. In the presence of 
minimally coupled fermions, the parameter appears in the equations of motion, giv- 
ing rise to a four-fermion interaction.?°> LQG nevertheless takes the Palatini-Holst 
action (with or without matter) as the starting point, adopting the attitude that 
the four-fermion interaction is very likely to be real on account of the fact that the 
effect of gravity on fermions is very difficult to measure. 


4. Quantum Kinematics 


Departing from the canonical description of GR described in Sec. 3, it is time to 
construct the quantum theory. 


4.1. Quantization scheme 


Following the standard strategy for quantization in gauge theories, as A = A’ zj,dx* 
is considered as the connection potential and E= Ee TO, is analogous to the 
SU(2) electric field (note r? = 6%7; = 7;), one would take functionals of A as the 
kinematical quantum states to start with. However, this straight approach does not 
lead us far enough. In LQG, instead of the connection field A(x), we consider the 
holonomy (i.e. Wilson line) h. defined as the path-ordered integral 


hy = Pexp f Ayride* (29) 
y 


over an oriented one-dimensional curve y in ©. Correspondingly, instead of the 
electric field £, we consider the surface integral 


Es 5 := | €abcel, fidx® A da? (30) 
Ss 


over a two-dimensional surface S in © and with a smooth smearing su(2)-valued 
function f(x) = f(x)'7;. The Poisson bracket {hy, Es,¢} can be straightforwardly 
computed by Eq. (17). The bracket vanishes if y and S do not intersect or ¥ lies 
within S. Furthermore, {hy} and {Es,} are closed under the Poisson bracket and 
form the loop algebra. 

The reason why we should take h, instead of A(x) as the fundamental variable 
is essentially because h, transforms more “nicely” than A does under both the 
SU(2) and the 3d diffeomorphism transformations, and consequently it is much 
more manageable to remove the gauge overcounting in the quantum theory. 


‘Accordingly, LQG is most naturally constructed for four-dimensional spacetime. Nevertheless, 
generalization to other dimensions is still possible, depending on what principles of LQG are to 
be upheld. (See Sec. 12.14.) 
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Under the SU(2) gauge transformation, A(x) is transformed via 


A(x) — A'(x) = A(x)A(x) At (x) + A(x)d A(z)", (31) 

while h, is transformed via 
hy hi, = A(aThy Aa?) (32) 
where x) and x} are the initial and final endpoints of y. While Eq. (31) involves 


every point x for A, Eq. (32) involves only the endpoints of y for hy. 
On the other hand, consider a 3d diffeomorphism map y : } — & that is smooth 
and invertible everywhere. Under y, the connection A transforms as a 1-form 


A> A’ = °A, (33) 
where y* is the pullback of vy. It follows that 
hy(A) > hy(A’) = hy(p"A) = hy, (A), (34) 


where yy = (7) is the image of y by y. Compared with Eq. (33) for A, the 
transformation Eq. (34) for hk. is more manageable. 

We will first construct the kinematical Hilbert space H using the functions of 
holonomies, and then implement the three constraints, one after another, to obtain 
the SU(2)-invariant Hilbert space Ha, the SU(2)- and Diff-invariant Hilbert space 
Hiny, and finally the physical Hilbert space Hpnys. The scheme can be summarized 
as 

H a Hea ae Hinv = Hphys: (35) 
The states of 1 will be called cylindrical functions; the states of He called spin 
networks; and the states of Hiny called s-knots. Finally, the physical Hilbert space 
Hphys is supposed to reveal the quantum dynamics. 


4.2. Cylindrical functions 


Let T be an oriented graph, which is defined as a collection of a finite number of 
oriented and (piecewise) smooth edges y, with / = 1,..., 2 embedded in ©. Consider 
a smooth function f of L SU(2) elements. The couple (I, f) defines a functional of 
Aas 
Wr,s[A] = (A| Ur, p) = F(R (A), .- +s Pye (A)). (36) 

These functionals (of A) are called cylindrical functions (of holonomies). Let Cyl 
denote the linear space of cylindrical functions for all T and f. With a suitable 
topology (the detail is not important here), Cyl is dense in the space of all contin- 
uous functionals of A; in this sense, cylindrical functions grasp all information of 
continuous functionals of A. 

If two cylindrical functions are defined for the same graph I’, we can define the 
inner product between them as 


(Wr, | Urq) =] dp +++ dtr Pagan gill) Dah agit, (37) 
SU(2)" 
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where dy; is the Haar measure on SU(2). If two cylindrical functions are defined 
by two different couples (I’, f’) and (”,g”), let P = {y1,.--,92} = {71,---.9,/U 
{y,---,Y{-} be the union of the two graphs IY = {+j,...,7,,} and I” = 
{y,---,7{.}. We can extend the functions f’ and g” to be defined in [ in 
the obvious way as f(hy,,..-,Ry,) = fl (Ryjs---,hy,) and g(hy,,---,hy,) = 
Pinks sug hyw,,)- The inner product Eq. (37) can then be extended to any two 
given cylindrical functions as 


(Ur, p) | Ure or) = (Ur,z | Wry) (38) 


This implements the inner product measure to the space Cyl. We can then define 
the kinematical Hilbert space H as the Cauchy completion of Cyl with respect to 
the norm of the inner product Eq. (38), and the dual space Cyl” as the completion 
of Cyl with respect to the weak topology defined by Eq. (38).) This complete the 
Gelfand triple Cyl C H C Cyl". (See Refs. 39-41 for the rigorous construction.) 

The cylindrical functions with support on a given graph [ form a finite- 
dimensional subspace Hr of H. Obviously, Hp = L?(SU(2)", du") with L being 
the number of edges in T. If © C I’, the Hilbert space Hp is a subspace of the 
Hilbert space Hr. This nested structure is called a projective family of Hilbert 
spaces and H is (and can be defined as) the projective limit of this family.4°4! 
It turns out 7 can be viewed as the space of square integrable functionals in the 
Ashtekar-Lewandowski measure; i.e. H = L?[A,duaz], where A is an extension of 
the space of smooth connections.?9 

The Peter—Wey] theorem states that a basis for the Hilbert space of L? functions 
on SU(2) is given by the matrix elements of the irreducible representations of the 
group. The irreducible representation of SU (2) is labeled by an half-integer number 
j and the matrix elements of the j representation RY are denoted as RY)%g(U) € C 
for U € SU(2). Then a basis of Hp is composed of the states 


ID, 1, a1, Gi) = IT, 41, tae Idi Ob; tae ,ap, G1, tae Bx); (39) 
which are defined via 
(AIT, jr, 01, 61) = ROY g, (hy, (A)) --- RO) g, (hy, (A). (40) 


For each graph IT’, the proper subspace Hp associated with I is the subspace of Hr 
spanned by |T, 7, a1, 4:) with 7; 4 0. The proper subspaces Hp are orthogonal to 
one another and span the whole H; i.e. 


H= DAr. (41) 
rE 


The “null” graph [T = 9 is included in the sum. It corresponds to the one- 
dimensional Hilbert space spanned by the trivial state |) defined as (A|@) = 1. 


JMore precisely, H is the space of the Cauchy sequences |V,,) in Cyl (i.e. || Um — Un|| converges 
to zero); Cyl* is the space of the sequences |W,,) such that (Wp | WY) converges for all |W) € Cyl. 
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An orthonormal basis of H is simply composed of the states |T’, 7;,a1, 3) with all 
graphs [ (including 9) and all spin labels 7; = 1/2,1,3/2,2,... (but j, 4 0). 

Among the basis states, an important special case is that the graph is given 
by a closed curve (or a “loop”) a. As @ has only one edge and no endpoints, the 
corresponding state |T, jz, az, 5;) becomes |a, 7), which is defined as 


(Ala,j) = Tr RY) G exp f A) (42) 


These states |a,j) are called loop states. For 7 = 1/2 particularly, (Ala,j) = 
Trha(A) is the Wilson loop. Similarly, for a “multiloop” [az] = (a1,...,a@n) given 
by a collection of a finite number of loops (without intersection), a multiloop state 
\[az], jr) is defined as 


(A| fou). ie) = T] RO (Pex f | A). (43) 


1=1 
The kinematical Hilbert space H carries a natural representation for both SU (2) 
and Diff groups. Under the SU(2) gauge transformation, the holonomy h, trans- 
forms by Eq. (32) and it follows that the basis states |I, j;,a;, 3) transform by 


IT’, ju, ay, (1) = Tall, ju, aj, G1) 


Pp 
= >, (11 Rey (A(at 1) ROOF 6 se") IT, 32,0, 67). 


at, \l=1 
(44) 


On the other hand, under the Diff transformation, Eq. (34) leads to 
[Yr,7) > Uy|Yr,r) = Yor,s)- (45) 


That is, a cylindrical function Up ,[A] with support on I is sent to a new cylindrical 
function Vy, ¢{[A] with support on the relocated graph yI’. A moment of reflection 
tells that the inner product is Diff-invariant 


(UpUr,7|UpGrg) = (Vor,¢ | Gor) = (Ur,z | Org). (46) 


In fact, the uniqueness theorem tells that the Ashtekar-Lewandowski measure is 
the only unique measure that gives rise to the (*-algebra) representation of the 
kinematical algebra invariant under spatial diffeomorphisms.*? 


4.3. Spin networks 


We are now ready to implement the Gauss constraint upon 1 to obtain the SU (2)- 
invariant space Ha. 

In the space of functionals [A], we promote A(x) to Ai(ax) as the multiplica- 
tive operator: 


(Aj (2)©)[A] = A, (x) ¥[Al, (47) 
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and E as the differential operator 

dU [A] 
5AG(a)’ 
in accordance with Eq. (17). The Gauss constraint Ce [A] = 0 in Eq. (18) is then 
promoted to the operator Cg|)]: 


(Co[\]V)[A] = ¥[A — DA], (49) 


(Eb) [A] := —i8nGhy (48) 


according to Eq. (19). The SU(2)-invariant states are those lying in the kernel of 
Ca] for arbitrary 2. 

In the quantum theory based on cylindrical functions of connections, in place of 
functionals of connections, it is the finite gauge transformation that is of primary 
importance. Accordingly, the space Hg as the kernel of Cg[A] is obtained as the 
invariant subspace of H under finite transformations U,, which act on |W) as 


(UO, ¥)[A] = U[AA AT + Ad At] (50) 


in accordance with Eq. (31). 

For a given graph I’, call the endpoints of the edges “nodes” and assume that, 
without loss of generality, the edges overlap, if they do at all, only at nodes. 
The oriented edges are then also referred to as “links.” Under a finite SU(2) 
transformation U,, the basis states transform according to Eq. (44), which is 
invariant in the subspace Hp and acts only on the nodes of [. At each node 
Un, assume there are ni, “ingoing” links and nou, “outgoing” links. The task is 
to find the SU(2)-invariant (ic. 7 = 0) spin states within the tensor product 
j1 @ +++ Gnin @ 51 Oe — =O0@0@--- at each node v,,. This can be achieved 
by finding appropriate linear superpositions 


|J = 0, M= 0) a > ig” rout Mir Mns, (11,71) @--+ @ ljnins Mrin)) 


4 


®@ (lit, M4) +++ @ [Thins —Mnoued)» (51) 
where i,” ... are called intertwiners and regarded as generalized Clebsch—Gordan 


coefficients, which can be obtained by the standard method of the recoupling the- 
ory*? (also see Appendix A of Ref. 9). 

With every node specified with an intertwiner denoted as i,, an SU (2)-invariant 
state |[, j1,7n) is given by 


N 
28 - G6) ...3(r) : 
Cen) =>, (11 ie Ae [T’, dty Ot, Pr), (52) 


a,,8, \n=1 
where N is the number of nodes, a”) (i =1,..., Mout) is one of a; that is “ingoing” 
to the node v,, and a (i =1,..., in) is one of 3; that is “outgoing” from the node 


Un. See Fig. 1 for a simple example. The states |I’, j;,2n) are called spin networks, 
and choice of 7; and 7, is called the “coloring” of the links and the nodes. Note that 
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ji 


= a4 tg 


j3 
Fig. 1. The left diagram represents the state |I’, j;, a1, 8,) while the right one represents the spin 


networks |I’, jz,in), both for the same simple graph I composed of three oriented links and two 
nodes. The state on the right is a linear superposition of states on the left via the intertwiners. 


the loop states |a, 7) and multiloop states |[a7], j:) are trivial spin networks without 
nodes. 

It is clear that the SU(2)-invariant states of Hr form a proper subspace of Hp, 
which inherits the same inner product structure of Hp. Consequently, we have the 
SU(2)-invariant Hilbert space Hae as the SU(2)-invariant subspace of H and the 
corresponding Gelfand triple Cyl C He C Cylé in the obvious manner. 

See Ref. 44 for the original idea of spin networks and Refs. 45 and 46 for the 
systematic implementation. 


4.4. S-knots 


The next step to implement the second constraint Cpis(N) = 0 for the far more 
crucial invariance: the 3d diffeomorphism invariance. (We will follow the lines of 
Sec. 6.2 in Ref. 8.) To solve the constraint, we adopt the “group averaging pro- 
cedure”: the invariant states are obtained by averaging over the elements of Cyl 
that are transitive to one another by the invariance group.*”4® It turns out the 
Diff-invariant states are not in H but in the extended space Cyl”. 

Given a graph TI’, denote by Diffp the subgroup of Diff that maps I to itself, and 
by TDiffp the subgroup of Diffp that preserves every edge of [I and its orientation. 
It is easy to see that the induced action of TDiffp on Cylp is trivial. The quotient 


Diff 


oor TDiff, 


(53) 


is the group of graph symmetries of T, which permutates the ordering and/or flips 
the orientations of the edges of [. GSp is a finite and discrete group and it induces 
a nontrivial action Csi on Cyl, in the obvious way. 

We construct the general solutions to the diffeomorphism constraint in two steps. 
First, given a state |Up) € Hp, we average it over the group of GS and obtain a 
projection map Poise from Hp to its subspace that is invariant under ree 


. 1 F 
Poinr|Ur) = Ne > U,|Vr), (54) 
yeGSr 
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where Np is the number of GSp and Uy is defined in Eq. (45). The state Poise |Ur) 
is GSp-invariant, as for any o € GSr we have 


Uy Por r|Ur) = => TUelr) = S> Uyroy|¥r) 
P oeGSr yeGSp 
= » Uo" |Vr) => Poia.r|Ur). (55) 


y!-low" €EGSp 


The map Power naturally extends to the projection Poise, which projects 71 to its 
subspace that is invariant under GSp for all T. 

Next, we average over the remaining diffeomorphisms that move the graph I. 
This is a very huge group and the resulting group-averaged states are genuinely 
“distributions” and belong to Cyl* instead of H. For any given state |Wr) € Hr, we 
can define the group-averaged state (7(Wr)| € Cyl* by its linear action on arbitrary 
cylindrical functions |®p-) € Cyl as 


(n(¥r)lér) = S) (Op Poie.r Ur |r), (56) 
pe Diff /Diftr 
where (-|-) is the inner product of H. This action is well defined because only a 
finite number of terms in the sum are nonzero when I’ = yI for some y. It follows 
from Eq. (46) that the state (7(W.,)| is invariant under the action of Diff: 


(n(Wr)|Up, br) = (n(Ur)| Sr’) (57) 


for all y € Diff. The space of these solutions to the diffeomorphism constraint is 
denoted by Cylhi¢, and we have constructed a map 


7: Cyl > Cylpie, (58) 


which sends every element of Cyl, upon group averaging, to a Diff-invariant state 
of Cyliie- 

It should be noted that 7 is not a projection as it maps Cyl onto a different 
space Cyl};. Nevertheless, the group averaging procedure naturally endows Cylpi¢ 
with the inner product 


(n(¥) | n(@)) = ()|®), (59) 
which is well defined as the right-hand side is independent of the specific choice 
of © and ® for the same states (7(W)| and (7(®)|. With respect to this inner 
product, the Hilbert space Hpig of Diff-invariant states is the Cauchy completion 
of Cylp i. Finally, we can obtain the general solutions to both the Gauss and the 
diffeomorphism constraints by simply restricting the starting state |¥) € Cyl to be 
SU (2) invariant, ie. we start with |V) € CylN Ha. The space Cyl;, 
to both the Gauss and diffeomorphism constraints is then given by 


and the Hilbert space Hiny of SU(2)- and Diff-invariant states is the Cauchy com- 
pletion of Cyl; 


inv Of solutions 


mv" 
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What happens to operators? For a given (SU(2) and) diffeomorphism invari- 
ant operator O acting on H (i.e. O commutes with Us: we will see examples of 
diffeomorphism invariant operators in Sec. 5.4), one can define the corresponding 
operator O* acting on Cyliie as 


(O*n(W)|®) := (n(¥)|O'), (61) 


which is well defined in the sense that O* maps from Cylpjq into Cylfig, since 
one can show that, by Eq. (57), (O*n(W)|U,6) = (O*n(¥)|6) provided that Ot 
commutes with oo Furthermore, the operator O* is hermitian in Hpise with respect 
to the inner product Eq. (59) if and only if O is hermitian in H. 

The fact that the (SU(2) and) diffeomorphism invariant Hilbert space can be 
rigorously constructed is very significant, in contrast to the quantization program 
of geometrodynamics using metrics as fundamental variables, in which the precise 
construction remains elusive. 

To understand the structure of Hiny, we consider the action of 7 on the basis 
states of spin networks |S) = |T, j1,in) of He. By Eq. (56), we have 


(n(S’) |n(S)) = (n(S")|S) 
0 if yf AI” for all y € Diff, 


={ S> (80,18) if =I" for some y € Diff, (62) 
gk EGSp 


where Uy, € GSr, and the state U,,|S) is obtained from |S) by changing the order- 
ing and/or the orientations of the edges of . As equivalence classes of unoriented 
graphs under diffeomorphisms are called “knots” and classified by their knotting 
structures of edges, the first line of Eq. (62) tells that two spin networks |S) and 
|S’) in He give rise to two orthogonal states (n($)| and (n(S’)| in Hiny, unless TP 
and I” are knotted in the same way. The states in Hiny are then distinguished by 
the knots, denoted as K, as well the colorings of links and nodes of K. The states 
with the same K but different colorings, however, are not necessarily orthonormal 
to one another, due to the nontrivial action of the discrete symmetry group GSp 
in the second line of Eq. (62). To obtain an orthonormal basis of Hiny, we have 
to further diagonalize the quadratic form defined by the second line of Eq. (62). 
Denote (K, c| the resulting states, where the discrete label c corresponds to the col- 
oring of links and nodes of [ up to the complications caused by the discrete graph 
symmetry group GSp. We call the states (K,c| s-knots and c the coloring of the 
knot K. It should be noted that diagonalizing the quadratic form in Eq. (62) may 
yield degenerate eigenstates, as the projection map Poise defined in Eq. (54) might 
have a nontrivial kernel. As a result, the coloring c for s-knots (K, c| in general has 
less choice than the coloring j;,7,, for spin networks |T, 71, in). 

Knots without nodes have been widely studied in the knot theory, while knots 
with nodes have also been studied but to a lesser extent.495° One peculiarity of 
knots with nodes is that there are graphs with four-valent or higher valent nodes 
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that cannot be mapped to one another by smooth diffeomorphisms.‘ Consequently, 
the knot classes are labeled by continuous parameters (moduli) and therefore the 
space Hiny remains nonseparable as the kinematical Hilbert space H.44°! This prob- 
lem can be resolved by extending the group Diff of smooth diffeomorphisms to the 
group Diff* of extended diffeomorphisms.®? Extended diffeomorphisms are maps 
from © to } that are smooth and invertible everywhere in © except at a finite num- 
ber of isolated points. With Diff replaced by Diff*, the knot classes are discretely 
classified and consequently the Hilbert space Hin, is separable. The “excessive size” 
of in terms of nonseparability turns out to be just a gauge artifact. It is argued 
in Ref. 52 (also see Sec. 6.7 of Ref. 9) that Diff* in place of Diff is in fact more 
natural as the quantum theory is concerned, for diffeomorphisms in Diff are too 
“rigid” in the sense that they leave invariant the linear structures of tangent spaces 
at nodes, which however have no direct physical significance. 

The physical interpretation of the s-knots states will be clear after we define the 
operators. 


5. Operators and Quantum Geometry 


The two fundamental variables in the canonical theory is the connection At and 
its conjugate momentum #.. In the space of functionals of A, the corresponding 
operators A‘ and E% are given by Eqs. (47) and (48), respectively. These two 
operators are however not good operators in the space Cyl of cylindrical functions. 
We will follow the lines of Sec. 6.6 of Ref. 9 to construct the appropriate quantum 
operators. 


5.1. Holonomy operator 


In Cyl, the operators of connections have to be replaced by the operators of 
holonomies defined in Eq. (29). Let (hy), be the matrix elements of the holon- 
omy h,. Then the corresponding operator (hs)\* p acting on |W) € Cyl is simply 
the multiplicative operator defined as 


((hy)* YA] 2= (hy) p(A) VIAL (63) 


The right-hand side is clearly in Cyl. In fact, any cylindrical function is immedi- 
ately well defined as a multiplicative operator in Cyl. For example, the operator S$ 
associated with the spin network |S) acts on another spin network |S’) as 


S|’) =|SUS"), (64) 


kAny 3d diffeomorphism y induces a linear map Jp (pushforward) from the tangent space of a 
given point p to the tangent space of y(p). In order for y to send two nodes into each other, we 
must have Jpvi; = vi, where v;, 7 = 1,...,n, are the n tangent vectors of the links at p and 
vi,i=1,...,n, are the n tangent vectors of the links at y(p). In general, however, there is no 
pushforward map sending n given vectors into n other given directions when n > 4, as the rank 
of Jp is only 3. 
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where |S U $”) is the spin network formed by gluing S and S$’ together in the obvious 
way 


(A|$|S’) = (A|S)(A|S"). (65) 


Furthermore, it is clear that $ is SU(2) invariant (i.e. S$ commutes with U,) and 
leaves the Hilbert space Hq invariant. Spin networks |S) can be constructed as 
the action of S' acting on the trivial state |); ie. $|0) = |S). In this sense, |0) is 
analogous to the Fock vacuum. 


5.2. Area operator 


On the other hand, when acting on cylindrical functions, the operator defined in 
Eq. (48) leads to 


where s is an arbitrary parametrization of the curve y, 7*(s) = dy“(s)/ds is the 
tangent to the curve at the point y(s), and 7 and 72 are the two segments into 
which y is separated by the point y(s). The right-hand side is a two-dimensional 
Dirac distribution (6°(---) is integrated over one dimension) and thus does not 
belong to Cyl. As E¢ is not a well-defined operator in #1, we instead seek the 
desired operator by smearing E over a two-dimensional surface S. In accordance 
with Eq. (30) (with f = 1), we define the operator 


F }6 
EB; = } 1 2 a San fA 
(S) sriGhy [de do“ na(c) 5Al(a(o))’ (67) 
where o = (c!, 0?) are the parametrizations of the surface S, and 
b e 
ate Ste Ox"(a) O2z°(c) (68) 


Oot 0a 
is the one-form normal to the surface S at the point x(a). When acting on 
holonomies, by Eq. (66), F;(S) yields 


E,(S)hy = —8iGhy S > thapr; hyp, (69) 
pEesny 


where p are intersection points (one, many, or none) between the curve y and the 
surface S, 7 and 7 are the two segments of y separated by the point p, and the 
sign + is given by + if the curve y pierces S at p “upwards” to the orientation of S 
and — if “downwards” to the orientation of S. The generalization to an arbitrary 
representation of the holonomy is obvious 


E,(S)R (hy) = -8riGhy J) th,pRO (n)hap. (70) 


pEesny 


Therefore, E;(S) is a well-defined operator on H. 
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The operator E;(S) is not SU(2) invariant, and we cannot obtain an SU(2)- 
invariant operator by simply contracting the index 7 as 


E2(S) := 6 B,(S)E,(S), (71) 
because the SU(2) transformation of E;(S) is complicated by the integral over S. 


Instead, we first partition S into N small surfaces S;, which become smaller and 
smaller as N — oo (for each N, ca S; = S), and then define the area operator 


associated with S as 
= lim Lev EM (Sr), (72) 


N—-o0o 


which corresponds to the classical area of S, 
= | / nab ny Bo da. (73) 
S 


Let us study the action of A(S) on a spin network |P, j1,in), assuming that no 
spin network nodes lies on S. For sufficiently large N, none of S; will contain more 
than one intersection point with [ and the sum over J becomes a sum over the 
intersection points p between S and I’. Consequently, by Eq. (70), we have 


A(S)IP, jtsin) = 87Ghy S > y/dn(ip +1) IV, jt in), (74) 
pEsnr 

where p are the intersection points between S and I, and j, is the color of the link 
that pierces S at p. Since the operator A(S ) is diagonal on spin network states and 
its eigenvalues are real, the operator A(S ) for an arbitrary S is well defined in He 
and is hermitian. For a complete and rigorous construction of the area operator, see 
Refs. 10, 53 and 54. The general result (with the possibility that the spin network 
nodes lie on S) is given by 


A(S)|P,...) =47Ghy S~ /2i8 G8 +L) + 29808 + 1) — GE +1) Poesy. 
pEesnr 


where j>, Lee and a are the colors of the links that emerge upwards (u), downwards 
(d), and tangentially (t) to the surface S, respectively. It is the key result of LQG 
that the spectrum of area is discrete. The smallest nonzero area eigenvalue is given 
by (with 7 = 1/2) 


A = 2V39nGhy = 2V3.07l, (76) 


which is of the order of the Planck area (2, = Gh (assuming ¥ is of the order of 
unity). Intrinsic discreteness of space at the Planck scale has long been expected in 
QG. In the context of LQG, this discreteness is not postulated or imposed by hand 
but rather arises as a direct consequence of the quantization in the same sense that 
the energy spectrum of an harmonic oscillator or of an atom is quantized. Also note 
that different choices of the numerical value of y give rise to nonequivalent quantum 
theories as the difference is reflected in the spectrum of the area operator. 
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5.3. Volume operator 


The two operators S' in Eq. (64) and A(S) in Eq. (72) are in principle sufficient to 
define the quantum theory. To better understand quantum states, we also define the 
volume operator, which plays a key role in the physical interpretation of quantum 
states. 

Consider a three-dimensional region R. The classical volume of FR is given by 


1 7 ma ~b ~c 
V(R) = [ery] sean EB Eyl. (77) 
To construct the quantum counterpart, we first have to regularize the quantity 
1 ~ a ~b ~C 
q€0* E; («)E;(x)E,(x) by the “three-hand holonomy” defined as 


1 sips 
31 epee) Tice) (©) 


Ln LU ane 
SNe (eee ie Oe Pe (amo le OF (78) 
where s, t and r are three points (close to but different from the point x), and 722 
are paths from x to o When s, t, and r are very close to 2, T*°°(a; s,t tr) approx- 


imates Lett Ei (a )E, (x) E,(x). As the “three-hand generalization” of E°(S), for a 
given closed surface, we define 


S) =f to f ao! | Potna(o)ne(a/ynelo") T™(x30,0,0"), (79) 
Ss Ss S 


where x is a point in the interior of S (the exact position of x is irrelevant as we 
will always consider the limit of small S). 

Partition the region R into small cubes R; (for each N, Ue 7 Ri = R) such 
that the coordinate volume of each cube is smaller than €? as N — oo. In the same 
spirit of Eq. (72), we can now define the volume operator associated with R as 


V(R) i= vq im », \/|E3(AR,)| (80) 


Val et 


i Mad Co 8,t,r) = 


where OR, is the boundary surface of the a Rr. 

When the operator V(R) acts on a spin network state |[',...), the three surface 
integrals over OR; in Eq. (79) give three intersection points, as in the case of the 
area operator. For € small enough, the only cubes whose surfaces have at least three 
intersections with the spin network are those containing a node of the spin network. 
Therefore, the sum over cubes I reduces to the sum over the nodes n € TNR, and 
we have 


O (RIE, ...) =e lim > ylW,, 110...) (81) 
V8! ey nePOR 
where 
Wr. SPOR Pa s= YS Teen) (82) 


r,s,tEDndRyz, 
résf£tfr 
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is the action of E3(ORz, ) on |[,...), which is the sum over the triplets (r,s,t) of 
distinct intersections between the spin network and the boundary of the cube Ry,, 
containing the node n. For each triplet (r,s,¢), the result of the action is denoted 
as T(s,t,r)|,...). The key point is that, in the limit ¢ > 0, the operator W;, does 
not change the graph of the spin network, nor the coloring of the links. Its only 
possible action is on the intertwiners. Consequently, we have 


V(RIIL, juin -+-in) = (16rGhy)?/? S> Vi, jist --iw), — (83) 
neInr 


where the coefficients V;,, ‘» can be calculated by the recoupling theory. 

A complete and rigorous construction of the volume operator can be found in 
Refs. 55 and 56. The detailed calculation of Vi, is presented in Refs. 57 and 58, 
where a list of eigenvalues is also given. It turns out that the node must be at least 
four-valent in order to have a nonvanishing volume.*® 

The volume operator is well defined and hermitian with a discrete spectrum 
of non-negative eigenvalues. Furthermore, we can choose a basis of intertwiners 7, 
that diagonalize the matrices V;,, ‘» in Eq. (83) so that the resulting spin networks 
are eigenstates of the area and the volume operators simultaneously. We denote V;,, 
the corresponding eigenvalues. 

In addition to area and volume operators, the length operator can also be defined 
and also yields a discrete spectrum, but it is far more complicated and less under- 


stood than the area and volume operators. 


5.4. Quantum geometry 


The volume operator essentially has contributions only from the nodes of a spin 
network, while the area operator has contributions from the links. Therefore, each 
node represents a quantum of volume and each link represents a quantum of area. 
That a spin network |, 71 ---jz,71...,7) can be interpreted as an ensemble of NV 
quanta of volume, or N “chunks” of space, which are separated from one another by 
the adjacent surfaces of LZ quanta of area. Each chunk of space is located “around” 
the note n with a quantized volume V;,. Two chunks are regarded as adjacent 
to each other if the two corresponding notes are connected by a link j;, which 
corresponds to the adjacent surface with a quantized area 87Ghy,/ji(ji + 1). The 
graph I dictates the adjacency relation among the chunks of space. 

The physical picture is compelling that a spin network state determines a quan- 
tized 3d metric. However, it should be noted that both the area operator A(S) 
and the volume operator V(R) are not diffeomorphism invariant (i.e. they do 
not commute with Ue) as the specification of S or R relies on spatial coordi- 
nates.! Nevertheless, we can specify the surface and region intrinsically on the 
knot of the spin network itself: A “region” is simply as a collection of nodes 


'The only exception is the total volume operator V= V(S), which is diffeomorphism invariant. 
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{ni,n2,..-}K = {n1,...|n; € notes of K} of the knot K associated with a graph 
T; a “surface,” as the boundary of a region, is a collection of knot links each of 
which connects to only one of {n1,n2,...}«. In this way, the volume operator 
V({n,...}) defined as 


0 if [EF] #4, 
Vig) Lesa > Vi, if E)=X, (84) 
} 


{ni,... 


where [I] denotes the knot class represented by IT, and the area operator 
A(O{n1,...}«) defined as 


0 if [[] A K, 
A(O{n1,...$K)|P,...) = S- 8nGhy nla +1) if (]=K, (85) 


HEO{N1,...$K 


are both diffeomorphism invariant. Thanks to diffeomorphism invariance, we can 
define the corresponding operators V*({n1,...}%) and A*(O{n1,...}«) acting on 
Cylpig via Eq. (61). The resulting operators are well defined on s-knot states.™ 
Furthermore, in principle, the eigenvalues of the area and the volume operators 
represent possible outcomes of the corresponding physical measurements.°! 

The situation is precisely the same as that in the classical theory. In classical GR, 
we distinguish between a three-metric gap and a three-geometry [q]; the latter is an 
equivalence class of the former modulo diffeomorphisms. The notion of geometry is 
diffeomorphism invariant while the notion of metric is not. Given a metric gap with 
coordinates x*, we can compute the area of S or the volume of ?, if we define S by a 
map (a!,a7) — #(c) and R by a map (a!, a7, 0°) — 2°(c). However, it makes no 
sense to ask what the area of S or the volume of R is in a given geometry [gq], because 
the coordinates have no significance in the geometry. Instead, given a geometry 
[q], we should specify surfaces or regions intrinsically on the geometry itself. For 
example, given the three-geometry of the solar system, the region and the surface 
of the earth are well defined, without any reference to coordinate localization, and 
it is meaningful to ask what the volume and area of the earth are. In this sense, a 
spin network can be regarded as a quantum (discretized) 3d metric, and an s-knot 
as a quantum 3d geometry. 

The physical interpretation of the s-knots is extremely appealing: They repre- 
sent different quantized 3d geometries, each of which is an abstract aggregate of 
chunks of discrete space separated by discrete adjacent surfaces. S-knots are not 
quantum excitations in space; rather, they are excitations on top of one another, 
as any reference to localization of the chunks and surfaces is dismissed. Also note 
that the “empty” s-knot (@| associated with the trivial spin network |) describes 
a space with no volume and no area at all. We therefore have brought off the 


™Here, we have disregard the technicalities due to the discrete group GSp of graph symmetries. 
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paradigm of background independence as advocated in Sec. 2.3. The manifestation 
of background independence will become even more prominent if we also include 
nongravitational matter fields as will be seen in Sec. 7. 


6. Scalar Constraint and Quantum Dynamics 


We have constructed the SU(2) and diffeomorphism invariant kinematical Hilbert 
space Hiny by imposing the Gauss and diffeomorphism constraints. While the quan- 
tum kinematics is well understood, the crux of the problem in LQG lies in the scalar 
constraint, implementation of which is supposed to reveal the quantum dynamics. 
One might attempt to repeat the group averaging procedure for the scalar con- 
straint as we did for the diffeomorphism constraint, but this strategy turns out 
very difficult because the finite transformations generated by the scalar constraint 
are poorly understood even at the classical level. Instead, we adopt the strategy: 
First, we regularize the classical expression of the scalar constraint; and second, we 
promote the regulated classical constraint to a quantum operator and then remove 
the regulator. (We will follow closely the lines of Sec. 6.3 in Ref. 8.) Since the scalar 
constraint is very intricate, its implementation is far less clean and complete than 
that of the other two constraints. Consequently, the quantum dynamics remains 
a challenging open problem in LQG. Readers are referred to Refs. 62-65 for more 
details. 


6.1. Regulated classical scalar constraint 


The classical scalar constraint is given by Eq. (22). Had we considered the Euclidean 
GR and chosen y = 1, we would have 7? = ¢ = 1 and the second term in Eq. (22) 
would vanish. Therefore, the first term has the interpretation of the scalar constraint 
for the Euclidean GR with y = 1. Accordingly, we rewrite the full Lorentzian 
constraint, i.e. Eq. (22) with o = —1, as 


CIN] = VyC™™[N] —2(1 + 7°) TIN], (86) 
where 
~a~b 
CBMAIIN] 2= 5 a [ae TEE CE (87) 
and 
T(N] := : Bo Ets Ki_K? (88) 
l67G Js det EB eral 


One of the difficulties to deal with C[N] is that it involves nonpolynomial func- 
tions of ES through Vdet E and kK’. Fortunately, the nonpolynomiality can be 
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circumvented by Thiemann’s trick.6*-® First, by the canonical relation Eq. (17), 
we can rewrite the combination for the cotriad e*, 


~b ~c 
€4 = 5 Eabe ek —2 = (89) 
2 Vdet B 
as a manageable Poisson bracket 
1 
: a A V 
eal) = eae (Au@).V), (90) 


where V = V(%) is the total volume of © given by Eq. (77) with R = XU. This 
allows us to express CE"*![N] as 


—saaan | ee) ce" Tr (Fas (2){Ac(z), V}) (91) 


in a form more suitable for loop quantization. 


chen] _ 


Second, we have 


1 _ 
Ki = —{A,(x), K}, 2 
= yy Alo) KY (92) 
where ix is defined as 
K:= : Br Ki E;, (93) 
x 
which can also be expressed as a Poisson bracket 
= 1 a 
7 sap tc nH Vhs (94) 


Therefore, T|N] can be recast as 
2 


aera [Be N(a) eth TH({Aala), K}{Adl 2), K}EAc(@), V9) 
(95) 


We have expressed CE"“![N] and T[N] in terms of A, F and V. The next step is 
to replace A and F' with holonomies, which are the “right” variables to be used for 
loop quantization. For a small path yz, of coordinate length ¢€ starting at x and 
tangent to u, the holonomy hy, , takes the expansion 


Ray, = 1+euAg(x) + O(e?), (96a) 
hy), = hyo = 1—eu"Ag(2) + O(€). (96b) 


T[N] =- 


Similarly, for a rectangular loop @z,u,» with one vertex at x and two sides tangent 
to u and v, each of coordinate length ¢, the holonomy h 


h 


takes the expansion 


Qr,uv 


=1+4 uv Fa(x) + O(e*), (97a) 


An,uwe 


heey Hhgct, = 1— uty’ Fyy(x) + O(e*). (97b) 


Or,uv 
Now, partition © into small cubic cells, edges of which are of coordinate length e. 
Denote by s1, s2, and s3 the edges of an elementary cell O based at a vertex vp, 
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and by (1, (G2, and ($3 the three oriented loops that are the boundaries of the 
three rectangular faces based at vg and orthogonal to s1, s2, and s3, respectively 
(see Fig. 2 in Ref. 8). By Eqs. (96) and (97), we can then regulate Eq. (91) as 
324 CB"'(N), where the contribution from each cell is 


ae N(v ~ 
cE 1(N) = aoe 4 Tr((he, = hg-1)h,-1{hs,,V}); (98) 
I 


and similarly regulate Eq. (95) as )> Za(V), where 


2N(v : 7 
Ta(N) = Gaya Ben other, Ky ha {hs;, K}hya {hex V})- (99) 


Note that we regulate {A, is by eae = V —h7'Vh, instead of simply 
{h,,V} and {A,K} by h>'{h,,K} = K — hy!Kh, instead of simply {h,, K}, 
because we want to keep “the scalar constiaint to be SU(2) invariant after the 
regularization. 

The regularization can be more generic than the above prescription. Instead 
of the simple cubic partition, we can partition © into cells O of arbitrary shape 
(particularly, the partition can be chosen to be a triangulation of /), and in every 
cell O we define edges s7, J = 1,...,ns and loops 3;,7 = 1,...,ng all based at a 
vertex vq inside O, where ns,ng may be different for different cells (see Fig. 3 in 
Ref. 8); furthermore, we can choose an arbitrary representation R of the SU(2) 
group other than R/2), The entire prescription is denoted by R, and called a 
permissible classical regulator if both the following conditions hold 


lim CG ae, (100a) 
lim Tr, = T(N], (100b) 
where 

CRN] = > CES), (101a) 

cB) = —N@o)_ ent (RO (hg,) — RD (hg-)) 

Re 3272G273/2 Bi B; 

me (PM U,) V}), (101b) 
Tr.IN] = )_ Tra), (102a) 

2N(u ~ : - 

Tr.o(N) = in ane >) TVET RO (hy) {R (he;), K} 

IIK 


x R (hea ){R (hs;), KJRO (hea ){R™ (hsx)sV}), (102) 
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and OC’! and T!/* are fixed constants, independent of the scale parameter €. There 
exists a great variety of permissible classical regulators, and this nonuniqueness is 
the source of quantization ambiguities. 


6.2. Quantum scalar constraint 


It is fairly straightforward to promote ce into a quantum operator acting on the 
space #1 of cylindrical functions. Recall that the total volume operator {= V(d) 
and the holonomy operator he are both well defined in H. Therefore, simply by 
replacing V with V, hy, with ee and Poisson brackets with commutators, we obtain 
the operator ce™ on H: 


Cre1N] = D1 CRN), ee) 


+Buc iN(vo) pi 4 4 
Crea (N) = — a5 agasa7ag 2 OP (RY (hp,) — RO (hg-1)) 
i,J 


4 


x RO (h,—1)[R (hs;), V))- (103b) 


Furthermore, because cet is regularized in a way preserving SU(2) invariance, 
Cee is SU(2) invariant and leaves the space Hg of spin networks invariant. When 
acting on a given spin network state |[,...), the part [RD (hs,), V] will yield 
zero unless the segment s intersects a node of I’, because the volume operator has 
contributions only from nodes of the spin network. Also note that while nodes must 
be at least four-valent to have nonvanishing volume, the operator cee can have 
contributions from trivalent nodes, as hs , adds one more edge to the intersection 
node. 

As cea is well defined in #/, here comes the critical issue: We have to ensure 
that the resulting quantum operator is covariant under diffeomorphisms. For this 
purpose, we have to restrict our regularization scheme to a diffeomorphism covariant 
quantum regulator, which is a family of permissible classical regulators that are not 
fixed but transform covariantly as a graph I is moved under diffeomorphisms. The 
precise definition of a diffeomorphism covariant quantum regulator is not important 
here, but the essential point is that such regulators exist and lead to the regulated 
operator Cen that is densely defined in the full Hilbert space 1 with domain Cyl 
and diffeomorphism covariant.®?-% The simplest and most convenient case of such 
regulators can be summarized as follows. For a given graph I, make the partition 
refined enough such that each cell 0 contains at most one node of [. For a cell 
containing a node n, the edges s; are assigned to the proper segments of the links 
of T incident at n; orientations of s; are all assigned to be outgoing at n. The loops 
Gi = Pun = rs are chosen to be the triangular loops spanned by s;7 and sj; the 
loop G77 contains no other points of I except for the edges s;, 57, and its orientation 
is defined as the same as the plane by the ordered pair of s; and s;. See Fig. 2 
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ie 
sedbe: 
SJ CME Sbny, 0 
ERTS if (e; x es) + eK 
THK a+ 50, 
ST m2 >0,<0,=0 


Sk 


Fig. 2. The edges s7,87,s,« and the triangular loop 377 at a trivalent node n. The pattern is 
similar at other multivalent nodes. C!7* and T//* are given by constants in accordance with 
the orientation of ey,e7,eK tangent to s7,57,S5K. 


for the illustration. Finally, the constants C’* = C//* are given by +K1,0, and 
the constants T//* given by +2, 0, depending on the orientation of a triple vector 
tangent to s7,8 7,8 relative to the background orientation of ©, where ki, K2 are 
fixed constants. With this regulator, the sum }>, in Eq. (103a) effectively becomes 
the sum )°,,<p over the nodes of I, and the lapse function N(vg) and the volume 


operator V in Eq. (103b) become N,, and V({n}), respectively, for each node. 
As Ca is well defined in 1, we can define the corresponding operator acting 
on Cyl* via its action on the states of Cyl by 


(CRe TU |®) := (U|CRL") (104) 


for every (W| € Cyl* and |®) € Cyl.” The final step is to remove the regulator and 
obtain the operator C®"* acting on Cyl*. It does not work if one attempts to take 
the limit directly 


(CEvly|6) = (W|lim CRe"'®), (105) 


since cee becomes ill-defined in Cyl in the limit « — 0. Instead, we should define 
the operator cha acting on Cyl* by 


(CB |®) := lim (¥| Cs), (106) 


where the limit is now a limit of a sequence of numbers, instead a sequence of 
operators. The limit exists if (W| is a diffeomorphism invariant state, namely (W| € 
Cyl};¢, a8 we can see in the following crucial observation. Given |®) a spin network 
I[,...), the operator coe on the right-hand side of Eq. (106) modifies |[, . . .) in two 
ways by changing its graph T and its coloring. The graph I is changed by the two 
classes of operators: h,,, and he, ,- The former superimposes an edge of coordinate 
length € to a link of T, and the latter adds the triangular loop as depicted in 
Fig. 2. When « is sufficiently small, changing € in the operator changes the resulting 
state, but the resulting state remains in the same diffeomorphism equivalence class. 
Consequently, the value of (U| CEA) for (Y| € Cyl5ig becomes independent of € 


"To be precise, we should call the operator on the left-hand side oo in the same spirit of 
Eq. (61), but we neglect * and ft to keep the notation simple. ; 
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once € is sufficiently small, and thus the € — 0 limit in Eq. (106) is finite. Therefore, 
we can well define the quantum operator CE" acting on the domain Cylhir, as the 
regulator is removed trivially. 

The Hamiltonian constraint operator, albeit defined upon spin networks with 
reference to coordinates in the first place, turns out to be well defined and indepen- 
dent of the regularization scale € on the domain of diffeomorphism invariant states, 
i.e. s-knots. To sum up, the small e of coordinate length loses its physical signif- 
icance at the diffeomorphism invariant level and the « — 0 limit becomes finite 
because making the regulator smaller cannot modify anything below the Planck 
scale as there is really nothing below the short-scale discreteness.© This striking 
feature as a consequence of the intimate interplay between diffeomorphism invari- 
ance and short-scale discreteness profoundly cures the ultraviolet pathology that 
has long plagued quantum field theory of gravity. 

The term T|N] in the scalar constraint can be dealt with in a completely parallel 
manner. Specifically, we promote Eq. (102) to the operator 


tp.IN] = 32 Tr.0(N), (107a) 


* 2iN(v 
Tr. (NV) = eat 


) > Tee RY (a,-1 )[R® (hie Kr] 
IIK 


x RO (h,-1)[RO (he,), Kn JRO (h, 


s 


MR (hsx),V]), — (107b) 


= 
K 
where Kp, is defined as 


. i 
Kr. = gal 


VCR", (108) 
and in the end define the operator T on the domain Cylpig by 

(TV|®) := lim (|Zp, ®). 109) 

The total quantum scalar operator 

CIN] = /yC™.N] — 2114.77) TN] 110) 

is well defined on the domain Cyl} i¢, namely 


CLN] : Cylpiee @ Cyl”. 111) 


Up to diffeomorphisms, the operator Cr. [N] with the diffeomorphism covariant 
quantum regulator is independent of ¢. That is, for any sufficiently small € and e’, 
given any |®) € Cyl, there is a diffeomorphism y such that 


Cr, [e*N]|®) = Up Cr.[N] Uf). (112) 
It should be noted, however, that the operator C[N] does not leave Cylig 


invariant, because C[N] does not commute with Cpig[N] even at the classical 
level. (Also note that C[N] depends on coordinates via N(x).) More precisely, let 
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(n(Wr)| € Cyliig be a group-averaged state as defined in Eq. (56). As defined 
in Eq. (106), the action of C[N] on the diffeomorphism invariant state (n(Wp)]| is 
given by 

(CIN]n(Wr)|®) := lim (n(Wr)|Cr, [N]®) 


e—0 


lim > (UpPoia,r Ur | Cr, [N]®) 
ye Diff /Diffr 


= lim > (Poi Ur | ul Cr, [N] ®) 
yeDifft /Diffr 


= lim > (Poiz.rUr | Cr, [e7*N] ul ®) 
EDI /Dit'p 


= lm S> eCri le  N)'Poie Ur |®), (113) 
pe Diff /Diffy 


where we have used Eqs. (56) and (112), and the regulator Re = R.(e, y) nontriv- 
ially depends on ¢ and y. It follows from Eq. (113) that 


(CIN]n(¥r)| lim (Up CRnen [oN] Poie Url. (114) 
pe Diff /Diffr 


The resulting state on the right-hand side is a distribution belonging to Cyl* as the 
sum is over the huge group Diff /Diffp, but it is not in Cylp;g as the summands are 
not arranged in the style of group averaging. The quantum theory defined by the 
quantum scalar constraint (C[N]©| = 0 nevertheless is diffeomorphism invariant, 
because what matters is the kernel of C[N], which is a proper subspace of Cyliji¢ 
and independent of N(x). 

On the other hand, C[N] is manifestly SU(2) invariant, as the regularization 
in Sec. 6.1 has been adopted to be SU(2) invariant. Taking into account both the 
Gauss constraint and the diffeomorphism constraint, we have the quantum operator 


C[N] well defined on the domain Cyl;,,,, namely 


CIN]: Gyig = Cyn. (115) 


Inv 


* 


The Cauchy completion of the kernel of C[N], which is a proper subspace of Cyl*,, 
is the sought-after physical Hilbert space Hpnys in Eq. (35). 


6.3. Solutions to the scalar constraint 


The scalar constraint operator C[N] has been rigorously defined on the domain 
Cyl},,,. The next step is to construct the physical Hilbert space Hpnys by solving 


mv" 


the solutions to the scalar constraint (C[N]¥| = 0. 

The actions of the resulting operators CE"! and T on a given SU(2) and dif- 
feomorphism invariant state (7(Wr,j,;)| are rather simple. As indicated in Eq. (114) 
with Eqs. (103) and (107), it turns out that, if |Wpj;) = |, j1,in) contains any 
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extraordinary loops (labeled by R‘&)), CE%! will remove one such loop and T will 
remove two in (n(Wr,;,;)|, in addition to possible changes in the intertwiner at the 
node.® The extraordinary loop is of the type introduced by the regulator at any of 
the nodes as depicted in Fig. 2. The details of the actions of CE"! and T are dis- 
cussed in Ref. 67. (Also see Ref. 68 for the general structure of the scalar constraint.) 
The action of adding or removing an extraordinary loop at a node is schematically 
illustrated in Fig. 3. 

An s-knot state (7(Vr,,j,)| is said to be simple if any spin networks belonging 
to the same diffeomorphism class of (n(Vr,,j.)| do not contain an extraordinary 
loop labeled by RY. Obviously, simple s-knots are annihilated by both CEucl and 
T and thus are solutions to both the Euclidean constraint and the full (Lorentzian) 
scalar constraint. In a sense, these simple solutions are analogues of time-symmetric 
solutions to the classical Hamiltonian constraint. Particularly, s-knot states of loops 
or multiloops are trivial examples of simple solutions. 

More solutions can be obtained by starting from the simple solutions. Let 
(n(bE,.)| be an s-knot obtained from (n(Up? I = (n(Vr,,j.)| by attachment 
of n extraordinary loops labeled by RY (note that there are many ways of the 
attachment), and denote by o 0 the subspace of Cyl}; spanned by those s-knot 
states with n extraordinary loops. The resulting spaces pe 
and have trivial intersection with one another, i.e. 


jo are finite-dimensional 


DE in ” DE hs =0, if To, J0),n 4 (Vo, 59), (116) 
Every (©| € Cylpig can be uniquely decomposed as 
(UY) = SO MHL), where (n(Wi"))| € DE). (117) 
(Tj), 


This unique decomposition enables us to find the solutions to the scalar constraint 
in a systematic way. 
First, for the solutions to the Euclidean constraint, we have 


(WICP'LN] =0 & (n(Wk)|CP*ILN] = 0, for every ([,j) and n. (118) 


That is, (W| is a solution to the Euclidean constraint if and only if its components 
with respect to the decomposition Eq. (117) are all solutions. This property reduces 


Fig. 3. When an extraordinary loop is added or removed at a node, the change is between the 
left diagram and the middle one, which can be understood as the right diagram. Here, we show 
the case that the extraordinary loop is labeled by RG=1/2) | 
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the problem of finding the solutions to the Euclidean scalar constraint to that of 
finding solutions in finite-dimensional subspaces, which is equivalent to the tractable 
task (by a computer) to find the kernel of finite matrices. 

Now, consider the full (Lorentzian) scalar constraint. In the scheme with respect 
to the decomposition Eq. (117), the problem of finding solutions to the full scalar 
constraint is reduced to a hierarchy of steps. That is, the scalar constraint equation 


(WICIN] = V7 (WIEE™"[N] — 201 + ¥7)(WIF IN] = 0 (119) 
is equivalent to the hierarchy of equations 


(n(¥)|T[N] = 0, 


2(1 + 7 )(n(@PITIN] = V7 (n(@P)|CP"1.0)], 
(120) 
2(1 +7? )(n(@EF?)ITIN] = V7 (M(VE2)|CE*[N], 


since CEX<! removes one extraordinary loop and T removes two. This hierarchical 
procedure gives a good control on the solutions and suggests a sound postulation 
that the series should terminate at a finite number of steps. 

Even though we have a good understanding of solutions to the scalar constraint, 
our knowledge about them is far less clear and complete than that about s-knot 
states. Additionally, we encounter various quantization ambiguities, notably the 
choice of RY for the SU(2) representation, which further complicate the solutions. 
It should also be remarked that the scalar constraint operator C [N] is evidently not 
hermitian in the space Hpi, as it removes extraordinary loops but does not add 
them. One can simply obtain the hermitian scalar constraint operator by replacing 
it with (C + Ct) /2, and the hermitian operator is expected to be better behaved for 
some technical issues as well as for some aspects of the classical limit. However, as 
a nonhermitian scalar constraint operator does not lead to any inconsistency, there 
is no logical necessity of being hermitian. Meanwhile, a great number of variant 
approaches of the quantum scalar constraint have been considered in the literature 
(see Sec. 7.4.1 of Ref. 9). 


6.4. Quantum dynamics 


The space of solutions to the quantum scalar constraint is the kernel of C [N], which 
is a proper subspace of Cyl;,,,. Consequently, the physical inner product between 
any two solutions are naturally inherited from the inner product of the space Cyl}, ; 
i.e. for (|, (®| € Kernel of C, the physical inner product is given by 


(U|®) ry. = (Y]®), (121) 


phys 
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where the right-hand side is defined in Eq. (59). With respect to the physical inner 
product (- | -) the Cauchy completion of the kernel of C is the physical Hilbert 
space Hphys- 

Even though Hpnys can been rigorously constructed (at least formally), the 
quantum dynamics in terms of evolution remains very obscure. Given physical state 
(V| € Hpnys, to read out the “evolution” (with respect to the arbitrary coordinate 
time ¢t used in the ADM foliation), one might attempt to designate the quantum 
counterpart of the classical Hamilton’s equation Eq. (26) as 

£ (BLOCH) Wry = SOUI[O), AIWpnoe. (122) 
where the quantum Hamiltonian H := (8Gy)~! (Colw -t)| + Cie [N] +C[N]J), and 
O(t) is the operator of an observable O corresponding to a measurement performed 
at coordinate time t. However, this strategy does not work, because the right-hand 
side simply vanishes as physical states are annihilated by all three constraints. 


phys?’ 


In a sense, the evolution, in the conventional notion, is completely “frozen.” This 
difficulty is closely related to the renowned “problem of time” in QG®”° and has 
entailed the timeless description of quantum mechanics (see Chap. 5 of Ref. 9 for an 
in-depth elaboration). In the literature, there are mainly two strategies to unveil the 
quantum dynamics: the “complete observable approach” versus the “partial observ- 
able approach.”° Complete observables, also referred to as “Dirac observables” or 
“physical observables,” are those that commute with all three constraints; by con- 
trast, partial observables are those that do not commute with all three constraints 
(the scalar constraint in particular). In other words, complete observables are free 
of any gauge ambiguities, while partial observables still have a remnant of gauge 
dependence in the strict sense. (For more on the distinction, see Refs. 71 and 72.) 

In the complete observable approach, instead of O(t), we construct a fam- 
ily of Dirac operators Ole parametrized by ®. The operator Ole is nontrivially 
constructed so that it is well defined in Hpnys and commutes with all constraint 


—~. 


operators for any value of ®. We interpret the operator O|s, as representing a 
measurement © performed at the instance when the observable ® takes the value 
®o.P For a given physical state (U| € Hpnys, the quantum dynamics is portrayed in 
terms of the expectation values 


(| Ola|V) phys, (123) 


which do not describe the evolution of observables with respect to a preferred 
time variable but, instead, the correlation between observables (O and ®). In this 
regard, the variable ® is said to serve as the “internal time” (also known as “internal 


°It should be remarked that there are many various approaches in the literature that are more or 
less in one form or the other of the two main strategies but different in finer detail. The aim here 
is neither to give decisive definitions of these two strategies nor to exhaust all possible approaches 
but to give general ideas about how the problem of quantum dynamics could be tackled. 

PIn the timeless language, a measurement is said to be conducted at some “instance,” rather than 
at some “instant.” 
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clock” ). Unfortunately, it is extremely difficult to construct Dirac operators that are 
nontrivial and physically sound. Usually, one also has to include nongravitational 
matter into the system in order to provide a privileged dynamical reference as 
the internal time.* In some symmetry-reduced theories, such as LQC, the Dirac 
operators can be explicitly formulated and consequently the quantum dynamics 
can be clearly deciphered, as will be shown in Sec. 11. 

The partial observable approach, on the other hand, adopts a very different phi- 
losophy. While it is asserted in the complete observable approach that only complete 
observables are physical, the partial observable approach contends that it would be 
too restrictive to dismiss all partial observables, because one can never grasp “total- 
ity” of the whole system and therefore some gauge degrees of freedom, especially 
those for the reparametrization invariance in time, nevertheless bear physical signif- 
icance as far as one concerns real measurements, which are conducted without the 
knowledge of totality. Unattainability of totality is even more crucial in quantum 
mechanics, as one can never disregard the observer, who is not viewed as a part of 
the system to be observed. By the standard Copenhagen interpretation of quantum 
mechanics, it is impossible to know and even invalid to ask what the physical state 
(| € Hpnys is for the whole world. In this sense, the quantum dynamics described 
in the style of Eq. (123) seems inadequate and heuristic at best. 

In accord with the Copenhagen interpretation, the partial observable approach 
describes the quantum dynamics in terms of the “if-then-what” prediction. Provided 
that a measurement O, performed at some instance yields the outcome a, the 
quantum dynamics is to predict what the probability is for the measurement Og 
performed at another instance to yield the outcome b. The only difference from 
the conventional quantum mechanics is that we do not need to specify the time 
separation between the two measurements, because any notion of time separation 
is in principle encoded in the measurement outcomes a and b themselves in the 
timeless description. (In fact, we do not even need to specify the time-ordering of 
the two measurements. See Sec. 3.5 of Ref. 73 for more discussions. ) 

For LQG, to make sense of the quantum dynamics of spacetime, we naturally 
choose the relevant measurements to be those for the intrinsic areas and volumes as 
studied in Sec. 5.4. Note that the corresponding operators defined in Eqs. (84) and 
(85) correspond to partial observables as opposed to complete observables, as they 
do not commute with the scalar constraint C [N]. Consequently, after a complete 
measurement of the quantum geometry, the kinematical state is collapsed into an 
eigenstate of the quantum geometry, which is an s-knot state (K,c| € Hiny. The 
quantum dynamics of spacetime is then posed as a predictive question: What then 
is the probability for another complete measurement of the quantum geometry to 
yield the outcome associated with the s-knot (K’,c’|? It is natural to postulate 


The strategy of the complete observable approach is closely related to the idea of the reduced 
phase space quantization, in which one finds a one-parameter family of complete observables O|o 
at the classical level before quantization. See Sec. 12.3. 
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that the probability is given by the |W(K,c; K’,c’)|”, where W(K,c; K’,c’) is the 
transition amplitude 


W(K,¢, K',c’) := (K,¢|Pe|K’,) (124) 


and Pz is the “projector” that projects an s-knot state in Hin, into the subspace 
Hphys- More precisely, the projector Pe : Hiny -~ Hphys C Hiny is formally given by 


Pe= > (UL, (125) 


(v | €Hpnys 


where {(| € Hpnys} forms an orthonormal basis of Hpnys. Therefore, the quantum 
dynamics in principle can be inferred from the complete knowledge about the phys- 
ical Hilbert space Hpnys. The transition amplitude given by Eq. (124) bears close 
similarity to that in the spin foam formalism as will be discussed in Sec. 9. 

Both the complete observable approach and the partial observable approach are 
far from fully developed and remain tentative, and our understanding about the 
quantum dynamics of spacetime is still very limited. The obstacles not only lie in 
the mathematical difficulties but are also deeply rooted from the conceptual (and 
even philosophical) riddles of interpreting the fundamental notions of space, time, 
quantum measurements, etc. 


7. Inclusion of Matter Fields 


So far, to bring out the main ideas of LQG, we have ignored nongravitational matter 
fields. Inclusion of matter fields does not require a major revamp of the underlying 
framework. The quantum states of space plus matter naturally extend the notion 
of s-knots with additional degrees of freedom. We will follow the lines of Sec. 7.2 of 
Ref. 9. For more details, see Chap. 12 of Ref. 10 and references therein (and also 
see Chap. 9 of Ref. 6 for the corresponding classical theories). 


7.1. Yang-Mills fields 


The easiest extension is to incorporate Yang-Mills fields. Let Gy) be the Yang— 
Mills group. The two 3d connections, Ashtekar connection A and Yang—Mills con- 
nection Ay, can be considered together as a single connection A = (A, Aym) 
associated with the group SU(2) x Gym. As the holonomies of Aym and surface 
integrals of the Yang-Mills electric field can be defined exactly in the same ways as 
those of A and EF, the construction of H, Ha, and Hiny extends straightforwardly 
to the total connection A without much difficulty. 

The gauge (G) and diffeomorphism (Diff) invariant quantum states of space 
plus Yang-Mills fields are given by the s-knot states that are classified by knotted 
graphs and carry irreducible representations of the group G on the links and the 
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corresponding intertwiners at the nodes. Because G is the direct product of SU(2) 
and Gym, its irreducible representations are simply the products of those of SU(2) 
and Gym. Consequently, the coloring for each link is extended as (j;,k;) and the 
coloring for each nodes is extended as (i, Wn), where k; is the irreducible represen- 
tation of Gym representing the Yang—Mills electric flux passing through the surface 
1, and w, is the intertwiner of Gym representing the Yang-Mills field strength at 
the node n. 


7.2. Fermions 


The next is to include fermions. Let n(x) be a Grassmann-valued fermionic field. 
It transforms as an irreducible representation k under the gauge transformation of 
the Yang-Mills group Gy), and as a fundamental (j = 1/2) representation under 
the SU(2) transformation. It is more convenient in LQG to take the densitized field 
w(x) := Vdet E(x) as the fundamental field variable.” 

As an extension of (36), the cylindrical function of the connection A and the 
fermion field 7 is a smooth (Grassmann-valued) function of L group elements of 
G = SU(2) x Gym and 2N|k| Grassmann variables (|k| is the dimension of k) 
defined as 


Up f[A, Y, vy] = F (ling (A), a slay, (A), 
seat” Cee pax Bia en Ne (126) 


where I is the graph comprised of L edges y, and N nodes located at %p=1,....n, 
M4=1,....|k| © W(k) are indices (i.e. weights) of the representation k, and Mj=1,....|n) € 
w(k) are indices of the conjugate representation k. Since Grassmann variables anti- 
commute with one another, cylindrical functions are at most linear in wf" = W" (ap) 
and wi” = "(a,) for each pair (n,m). If two cylindrical functions are defined for 
the same graph IT’, as an extension of (37), the inner product between them is 
defined as 


(Urs | Ur g) 
L N a 
=f HU (Foeve-+ f avav (Zip.n-+ / aia” ) 
GP a1 mew(k)n=1 
x Faas g hag gO on exe ae PE coe i) 
X G(Ray yey Ryn Pr sees ae TO cece De (127) 


where dy is the Haar measure on G, Zy,y* is a linear transformation that maps 
the integrand into a new (Grassmann-valued) function by assigning y = 0 and 
v* =0, J dydy* is the Grassmann integral over the superspace spanned by ~ and 
w*, and ~™,w are treated as independent Grassmann variables. The extension 
of this inner product to any two cylindrical functions is completely analogous to 
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that in the case of pure gravity. The construction of H, He, and Hiny can then be 
readily repeated. 

The gauge and diffeomorphism invariant quantum states are again given by the 
s-knots with extended colorings. For each node n, in addition to the intertwiner 
(in, Wn), we also have to specify an integer F,, as the degree of the monomial 
in w™(x,), which determines the fermion number in the region of the node, and 
an integer F,, as the degree of the monomial in 7™(x,), which determines the 
antifermion number in the region of the node. 

In the presence of fermions, at each node n, the intertwiner (7,,w,) is not only 
intertwined with the coloring (j;,k;) of the links attached to n but also with the 
(anti)fermion numbers F;,, F;,. This is because fermions carry the 7 = 1/2 represen- 
tation indices of SU(2) and the & representation indices of Gym, which have to be 
properly contracted with the matrix elements of RO (h,,(A)) and R“) (h.,, (Aym)) 
in a way such that the generalized Clebsh—Gordon conditions are satisfied for both 
SU(2) and Gym. In other words, fermions are quanta of “charged” particles, with 
both SU(2) and Gym charges, and the quanta of electric flux labeled by 4; for 
SU(2) and k; for Gym can emerge from and end at the charged particles. (The 
fact that fermions are SU(2) charged implies that they contribute to the spectrum 
of the area operator. See Ref. 75 for more discussions.) In the same regard, both 
the gravitational and Yang-Mills fields are self-sourcing, as they yield colorings for 
both links and nodes. 


7.3. Scalar fields 


Scalar fields can also be included in LQG, but in a less natural manner than those 
for Yang-Mills fields and fermions. 

Let ¢(a) be a scalar field, which transforms as a trivial (j = 0) representation 
under the SU(2) transformation and as an irreducible representation k’ under the 
gauge transformation of the Yang-Mills group Gyy. Thus, @ takes the value in 
the vector space of the k’ representation. The problem is that the k’ representation 
vector space is noncompact with respect to natural Gyy-invariant measures, thus 
rendering the definition of the inner product between cylindrical functions difficult. 
(Fermions do not suffer from this problem, since the Grassmann integral naturally 
gives the appropriate measure.) One way to get around this problem is to assume 
k’ to be the adjoint representation of Gy and replace ¢(a) with the associated 
“point holonomy” U(x) := exp (@(x)), which takes the value of Gyn. and admits 
the natural Haar measure. 

The cylindrical function of A, ¢), and ¢ is a smooth (Grassmann-valued) function 
of L group elements of G = SU(2) x Gym, 2N|k| Grassmann variables, and N group 
elements of Gy), defined as 


Ur s[A, Y, ~, ¢| = fl hep A), as) hyp (A), y™ (x1), ease yeni (xn), 
w OC ae gat ge (ae) 0 sag 8 Cay) (128) 
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The inner product between two cylindrical functions of the same graph is then 
naturally defined as 


L 
(Wr ¢| Ur g) = [Te [., [[ aim 
[=1 


OM n= 1 
IN | —1 
i (Zev + / dey deb *) (Zee.ae- - if din Mn ‘) 
mew(k) n=1 
Xf (Poy yee Py Vet, Wael OPN Uy, ...,UN) 
| a Oe a ea pel U,,...,UN), 


(129) 


where dy; is the Haar measure on G, and du, the Haar measure on Gy). Extension 
to any two cylindrical functions is straightforward as before. 

In the presence of scalar fields, of the resulting s-knot state, the coloring for 
each node n is augmented with an additional integer number S,,, which represents 
the total adjoint charge, or equivalently the total number, of the scalar particles 
in the region of the node. Like fermions, scalar fields carry Gym (adjoint) charge 
and accordingly S,, is coupled with w,,k), F, in the generalized Clebsh—Gordon 
condition for Gy). Unlike fermions, on the other hand, scalar fields are trivial for 
SU(2) and S,, is decoupled from i,,, 7; in the generalized Clebsh—Gordon condition 
for SU(2). 


7.4. S-knots of geometry and matter 


In summary, with inclusion of all kinds of matter (Yang-Mills fields, fermions, 
scalar fields), the gauge and diffeomorphism quantum states are s-knots (K,c| = 
(V, jt,2n; kt, Wn; Fn; Fn, Sn| labeled by the following quantum numbers: 


T: an abstract knotted graph with oriented links / and nodes n. 

ju: an irreducible 7 representation of SU(2) associated with each link J. 
in: an SU(2) intertwiner associated with each node n. 

ki: an irreducible k representation of Gy) associated with each link /. 
Wn: a Gyyy intertwiner associated with each node n. 

F,,, Fy: two integers associated with each node n. 


Sy: an integer associated with each node n. 


These quantum numbers correspond to physical quantities as listed in Table 2 

The physical interpretation of s-knots as discussed in Sec. 5.4 can be directly 
generalized to the s-knots of both geometry and matter in the obvious way. The 
paradigm of background independence is even more remarkable with inclusion of 
matter fields, as geometry and matter fields are truly on the equal footing and live 
on top of one another via their contiguous relations without any reference to a given 
background. 
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Table 2. Physical quantities for the quantum numbers of s-knots. 


Quantum number Physical quantity 
T Adjacency relation of regions n and surfaces | 
jl Area of the surface | 
Vis Volume of the region n 
ky Yang-Mills electric flux through the surface | 
Wn Yang-Mills field strength at the region n 
Fn, Fn Numbers of fermions and antifermions at the region n 
Sn Number (total adjoint charge) of scalars at the region n 


Furthermore, in the presence of nongravitational matter fields, there is no diffi- 
culty to perform the ADM foliation and obtain the three constraints. Particularly, 
the scalar constraint of the classical theory can be regulated in the same fashion 
as that in Sec. 6.1 and then promoted to the quantum operator as in Sec. 6.2. 
Therefore, the quantum theory of dynamics of spacetime plus matter in principle 
can be constructed in the same manner as in Secs. 6.3 and 6.4. 


8. Low-Energy Physics 


The essential premise of LQG is that quantum states are background-independent 
excitations out of nothingness — a concept in direct conflict with the foundation 
of conventional QFTs. This poses the big challenge to LQG to show how the low- 
energy description of conventional QFTs arises from the underlying Planckian world 
in an appropriate sense (of coarse graining). The final resolution is still far out of 
reach, but the research is being undertaken step by step. The low-energy physics 
remains one of the most wanted missing pieces in LQG — not until we figure 
out the low-energy physics of LQG and show its agreement with Einstein’s theory 
of classical GR and compatibility with the QFT of the Standard Model can we 
declare LQG to be an adequate quantum theory of GR, because both classical GR 
and the Standard Model have been intensively tested in the low-energy regime. In 
this section, we present the basic ideas of recovering low-energy physics and refer 
readers to Chap. 11 of Ref. 10 for more systematic treatments and more details. 


8.1. Weave states 


It is impressive that LQG yields discrete spectra of geometry as a natural conse- 
quence of quantization. The eigenstates of geometry, i.e. s-knots, however do not 
look akin at all to the smooth macroscopic spaces (curved or flat) we are familiar 
with. What then is the connection between the discrete quantum geometry and the 
smooth classical geometry? 

Consider a very huge spin network with a very large number of nodes and links, 
each of which is of area or volume in the Planck scale. This is a big lattice of 
Planck-scale lattice size, but it appears as a smooth 3d geometry when probed at 
a macroscopic scale much larger than the Planck scale. This is analogous to the 


II-508 D.-W. Chiou 


fabric of a piece of cloth, which is composed of thousands of granular threads but 
appear smooth at a distance. 

More precisely, given a fixed classical 3d metric qap(x) = bije4,(x)e} (x), it is 
possible to construct a spin network state |S) that approximates the metric at scale 
1 > €p;, much larger than the Planck length @p;. That is, for any region R and 
surface S, we have 


A(S)|S) = (A(S)|e + O(6e1/2))|8), (130a) 
V(R)|S) = (V(R)le + O(6p1/1)) |S), (130b) 


where A(S)|. is the classical area of S defined in Eq. (73) and V(R)|- is the classical 
volume of R defined in Eq. (77) with the given cotriad e',, and O(€p;/1) denotes 
small corrections in ¢p;/1. Such a spin network |S) is called a weave state of the 
metric gap. This definition is given for spin networks but can be easily carried over 
to the diffeomorphism invariant level for s-knots. 

Several weave states have been constructed for flat space, Schwarzschild space, 
space with gravitational waves, and more.”6 It should be noted that Eq. (130) does 
not determine a unique weave state. When all macroscopic physics is taken into 
account, it seems more likely that the proper quantum state for a macroscopic 
geometry is a coherent superposition of the weave state solutions. 


8.2. Loop states versus Fock states 


The basic variables of LQG are holonomies (Wilson loops) of the connection A 
along one-dimensional curves and the fluxs of the conjugate momentum E across 
two-dimensional surfaces. These variables however fail to be well defined in the 
Fock space of excitations of A in the perturbative QFTs. A further inquiry into the 
low-energy physics of LQG then is to understand how the perturbative description 
of quantum excitations in terms of Fock states in conventional QFTs arise as a 
low-energy limit of the nonperturbative theory in terms of loop states. 

These issues have been studied for simple examples. The main effort so far is to 
construct mathematical and conceptual tools that will facilitate a systematic analy- 
sis of quantum fields on semiclassical states of quantum geometry.“”’® Particularly, 
the connection between loop states and Fock states has been studied in detail for 
the Maxwell field’? ®? and linearized gravity®? in Minkowski spacetime. 


8.3. Holomorphic coherent states 


The relation between quantum states and the classical theory is most likely to be 
understood by the techniques of coherent states. Various constructions of coherent 
states have been suggested. Here we briefly describe the formulation of holomorphic 
coherent states®* ®° by following the lines of Ref. 89 for the Euclidean theory. 
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Consider the heat kernel K;(h, ho) on SU(2), which is given by the Peter-Weyl 
expansion: 


K,(h, ho) = S025 + le FO! Tr RO (hho), (131) 
J 
where the positive real number t is the “heat kernel time.” For a given graph T of 
N nodes, a holomorphic coherent state associated with I’ is defined as the SU(2)- 
invariant projection of a product over links of heat kernels 


Ur, oe (hav) = dga Ke, (habs JaHavg; '), (132) 
bstab su(2)® II II b b 
where, to simplify the notation, we use a,b,...=1,...,.N to denote the nodes of T 
and the (ordered) couples ab to denote the (oriented) links.’ The label H,» for each 
link ab is an element of SZ(2,C), which is diffeomorphic to the SU(2) cotangent 
bundle T* SU (2) = SU(2) x su(2)* = SU(2) x su(2). 
The state given in Eq. (132) is analogous to the standard wave packet of non- 
relativistic quantum theory 
1 (w@=29)? i 
= —~T52— en Po(t—20) 
WVxo,p0,0 = (Qno2)1/4 e. do2 enPo 0) (133) 
which is peaked in the position x9 as well as in the momentum po and can be 
written (up to a constant phase) as a Gaussian function 


WXo,0 — 


1 = (w7-Xo)? 134 
Sara” doe 
(Q702)1/4 e (134) 
peaked on a complex position Xo := %o + 2ic7h~1po. As K;(h, Ho) is the analytic 
continuation to SL(2,C) of the heat kernel K;,(h,ho) on SU(2) and t = 0 reduces 
Ki(h, ho) to a delta function, H,, is analogous to Xo (while hay analogous to 29) 
and tay is analogous to o. 
As S'L(2,C) is diffeomorphic to SU(2) x su(2), we can decompose each SL(2, C) 
element as 
fishes, (135) 
where hay € SU(2) and Lay, € su(2). Alternatively, H,» can be written in the form 
Hap = Nab e Eartinas) Bet (136) 


where Nab, Mba € SU(2) are two unrelated group elements of SU(2), € € [0, 27) is 
an angle, and 7, € R* is a positive real numbers. Any element n of SU(2) can be 
associated with a unit vector n € R? via 


n:= RO=)(n) -z, (137) 
where z = (0,0,1) € R° and RV=))(n) acts as a rotation matrix. Consequently, 


for each link | = ab, we have four labels of two unit vectors, one angle, and one 


This notation scheme is well-suited for complete graphs, but generalization to arbitrary (non- 
complete) graphs is obvious. 
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real number: (njv), M¢(1), 1, M1) = (Mad, Mba; €ad, Nad), where i(1) and f(/) denote the 
initial and final endpoints (nodes) of the link J. 

Therefore, apart from the labels t,, (its meaning will be clarified shortly), the 
state given in Eq. (132) associated with a spin network graph T is labeled by a 
number 7 and an angle € for each link J, and for each node a set of unit vectors 
n, one for each link at that node. These variables together admit a geometric 
interpretation of a simplicial three-complex as follows. The graph [ is assumed to 
be dual to a simplicial decomposition of the spatial manifold, as each node is dual 
to a 3-simplex (chunk of space) and each link is dual to a face of two adjacent 
simplices. The vectors n’s at a node are outgoing unit vectors normal to the faces 
of the simplex dual to the node,* and the positive parameter 7 for a link / is 
related to a spin j), which is the average of the area of the face dual to the link J. 
Furthermore, the simplicial extrinsic curvature is specified by an angle for each face 
of two adjacent simplices and is identified by &;. To sum up, the state in Eq. (132) 
is specified by both an intrinsic geometry (labeled by n and j?) and an extrinsic 
geometry (labeled by €;) for a simplicial three-complex due to L. 

To see that the holomorphic coherent states represent semiclassical states at 
some appropriate large-scale limit, we study their asymptotics for large 7. Using 
the asymptotic formula 


Ras) (e~HEortinas)on/2) zg eHEatder Medio] jn5, + jan)(jabs +Jab] (138) 


for Nap — 00, it was shown in Ref. 89 that the holomorphic coherent state in 
Eq. (132) is given by the superposition 


WY Hasta (hap) od ‘> (11 (2300 + 1) C50 £an,0, v)) VT jan. Pa (ngs) (Rad), (139) 


Jab ab 


where €;0 ¢,50(j) is a Gaussian function multiplied by a phase: 


: 0\2 
; JTJ —itj 
C70, €,09 (7) := exp (-43*) € i) (140) 
with 
i 1 
(27°, +1) := a a and 0°, := : (141) 
tab 2tap 


Here, ®,(nap) is the coherent intertwiner introduced in Ref. 90, which is a linear 
superposition of an orthonormal intertwiner basis er given by 


®, (Nav) i ie = 2, Pl (Hap) (142) 


my 


with 


®;, (Mav) = Via * we jabs Mav) ; (143) 


‘Note that in general ngy # —Npq. The difference between ng, and —np, encodes connections of 
the intrinsic geometry. 
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where |jap, Map) >= R42) (nav) Jab; +jab)- The states WP jan,Pa (nas) ave Spin networks 
with coherent intertwiners 


Wr jap,a (nar) Rad) = >> (11 i, ) Vr javsia (Rab). (144) 


ta 


The asymptotic states given by Eq. (139) are exactly the boundary semiclassical 
states used in Ref. 91 and the coefficients cjo ¢,0(j) in Eq. (140) are the same as 
proposed in Ref. 92, which demand that the states are peaked both on the area 
(j°) and on the extrinsic angle (€) with appropriate spread widths (specified by 
o° = 1/(2t) = (j°)* with k < 2). These states represent the boundaries of Euclidean 
4-geometries and are used to compute transition amplitudes for given boundary 
states in the Euclidean theory (see Sec. 12.6). The analysis of graviton propagators 
provides a methodology for studying the low-energy physics, and particularly the 
comparison to the classical theory of Euclidean GR confirms that the value € is 
fixed by € = ycos~!(—1/4). The construction of holomorphic coherent states for 
the Lorentzian theory however is much less established and demands more works 
of further research. 


9. Spin Foam Theory 


Conventional QFTs admit two different formulations: the canonical (Hamiltonian) 
formalism and the sum-over-histories (path integral) formalism. So far we have 
concentrated on the canonical formulation of LQG. It is time to discuss the “spin 
foam” theory, which can be viewed as the covariant approach of LQG, alternative to 
the canonical approach. While the general structure of spin foam models matches 
nicely with the canonical theory of LQG, the precise relation between these two 
formalisms however is still not entirely clear and remains a key topic in current 
research (see Sec. 12.5). Up to now, the canonical theory of LQG and the theory of 
spin foams should be regarded as two closely related but independent approaches 
of QG. In this section, we present the basic ideas of the spin foam theory and 
refer readers to Refs. 13-17 for detailed construction of spin foam models. Also see 
Chap. 9 of Ref. 9 and Chap. 14 of Ref. 10 for shorter accounts. 


9.1. From s-knots to spin foams 


a 


Spin foams can be regarded as the “worldsurfaces” swept out by s-knots traveling 
and transmuting in time. A spin foam represents a quantized spacetime, in the same 
sense that an s-knot represents a quantized space. 

Consider the transition amplitude from one s-knot (k’| to another (k| 


W(k, k’) := (k|Pe|k’) (145) 
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as given in Eq. (124). The operator Pe projects s-knots into the kernel of C[N]. 
Heuristically, we can write the projector as 


D1: —CIN]t _ e~ Bt 
Po = jim e =e 2s (E| = 23 |E)(E (146) 


assuming the operator C [N] has a non-negative spectrum of F.' Consequently, 
W(k,k’) = Jim (kle~CN IER). (147) 


In the same fashion of the path integral in quantum mechanics, inserting resolutions 
of the identity 1 = >>, |k)(k|, we can expand the above expression as a product of 
small time-step evolutions 

W(k,k’) = lim tim S* (he EMA ey) (Ras eC" ea_1) 


t—oo M—- oo 
Lyoko 


- (koje CWNI4# ey (ky Je CAR’), (148) 


where At = t/M. For a fixed t, we can always make M big enough such that At 
smaller than any given number. When At is sufficiently small, we have 


(k alee [NJAC ) Oknttskn At N(v)(kn41| C[1]|kn); (149) 


where (kn41|C[N]|kn) = N(v)(kn41|C[1]|kn) is nonvanishing only between two s- 
knots ky, kyn+1 that differ by the action of C at a node v. As discussed in Sec. 6.3, 
the action of C is to add or remove one or two extraordinary loops and also modify 
the colorings. Therefore, while 4d;,,,,,4,, keeps kn41 to be the same as ky, the action 
of (kn+1{C[N]|kn) is on the nodes of s-knots as schematically illustrated in Fig. 3. 
Consequently, for k 4 k’, Eq. (148) leads to 


Co 


Wk.) = Jim SU(-At)**t ST (ICL ev) (Kav [CU] kv—1) 
N=0 Ky gesns kn 
kn4itkn 


- (ko |C[1]|kx) (kx |C[1]|k’), (150) 


where we have absorbed the factor N(v) into At and also identify k’ = ko and 
k = kn 41 for shorthand. 

Now, recall that in Sec. 6.2, as a consequence of the intimate interplay 
between 3d diffeomorphism invariance and 3d short-scale discreteness, the value 
of (W| cos) in Eq. (106) turns out to be independent of € once e is sufficiently 


tAs commented in Sec. 6.2, the projector Pe is independent of the choice of N(x), but we have 
to assume hermiticity and positive definiteness of CIN] in Eq. (146). Hermiticity can be easily 
prescribed as discussed in the last paragraph in Sec. 6.3. Positive definiteness, however, is not 
guaranteed, but heuristically we can replace the local scalar constraint C(x) with \/C(x)? before 
regularization and quantization to yield positive definiteness. This ae tweak in fact makes 
good sense from the standpoint of the Master constraint program, which will be outlined in 
Sec. 12.1. 
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small and consequently the regulator for « — 0 can be removed. The regulator At 
on the right-hand side of Eq. (150) is analogous to € in the sense that reparametriza- 
tion invariance in time now plays the role of 3d diffeomorphism invariance. That is, 
as long as At is small enough, making it smaller will not change anything further. 
In addition to discreteness of space at the Planck length scale, let us postulate also 
discreteness of time at the Planck time scale. It is then suggested, somehow by the 
interplay between reparametrization invariance and short-scale discreteness in time, 
that the dependence on At in Eq. (150) should go away as At becomes sufficiently 
small and consequently the regulator for At — 0 should be removed." 
By removing the regulator At, Eq. (150) then leads to 


W(k,k’)= So w(N)S> AC™) = So w(N(c))A(), (151) 
N=0 o(N) ae 


where w(N) is a weight factor for different N arising from (—At)%*!, and 


o®) := (k = knai,kw,kw_-1,...,41,k' = ko) with kni1 #4 kn is a discrete 
sequence of s-knots, which represents a “history” from k’ to k with N +1 times 
of intermediate state change. The amplitude associated with a particular history 
o = (k,-++, ky4i,ky,-++,k’) is given by 


A(o) = TJ A0(0), (152) 


where v labels the steps of the history, and A,(c) is determined by 
A,(o) ~ (ku+| ClL]Ihe) (153) 


up to a constant proportional factor that can be absorbed into rescaling of At. 
In Eq. (151), the transition amplitude is cast in a sum-over-histories formulation, 
which is not a functional integral over continuous histories of fields but a sum over 
discrete histories of s-knots. 

A history o = (k,...,kv41,ky,...,k’), kn4i 4 kn, is called a spin foam. More 
precisely, imagine that a graph of an s-knot in an abstract 4d space moves upward 
along the “time” direction and the graph is changed by branching of its edges 
at each step under the action of C, the worldsheet swept out by the moving and 
changing graph is the spin foam. Call “faces” and denote by f the worldsurfaces of 
the links of the graph; call “edges” and denote by e the worldlines of the nodes of 
the graph; and call “vertices” and denote by v the points at which the edges branch. 
Figure 4 depicts a vertex of a spin foam corresponding to the action in Fig. 3, and 
Fig. 5 shows a simple examples of spin foams. 


“The short-scale discreteness in time is only postulated. Whether the canonical theory of LQG 
gives rise to temporal discreteness is unknown. The suggestion is only heuristic. Rigorously, to 
make sense of it, we might have to reformulate the regulator R_ in Sec. 6.2 such that it is intricately 
matched with the lapse function N in a 4d diffeomorphism covariant manner. 
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Fig. 4. Left: A typical spin foam vertex (colorings of adjacent edges and faces are not indicated). 
Right: Time-slicing (foliation) of the left diagram gives the evolution of s-knots (time is chosen to 
be vertically upward). The trivalent node on the bottom is branched into three trivalent nodes on 
the top. 


Fig. 5. Left: A simple spin foam composed of 2 vertices, 6 edges, and 6 faces (3 faces inside 
the cylinder are shaded; the other 3 on the surface of the cylinder are not). Its top and bottom 
boundaries, (k| and (k’|, are given by two 6-shaped s-knots. Middle: Time-slicing of the left spin 
foam shows the evolution from (k’| to (k| through an intermediate s-knot state. Right: The left 
and right diagrams overlapped for better visualization. 


The combinatorial object defined by the collection and the adjacency relation of 
faces f, edges e, and vertices v is called a “two-complex” and denoted by I. As an s- 
knot is identified not only by its graph but also the colorings of its links (irreducible 
representations) and nodes (intertwiners), accordingly a spin form denoted by 


o = (L,jpie) (154) 


is determined by a two-complex I’, the coloring of irreducible representations j + 
associated with faces f, and the coloring of intertwiners 7. associated with edges e. 


9.2. Spin foam formalism 


The discussion in the previous subsection motivates a sum-over-histories formula- 
tion of QG. For a given spin foam o = (I, j,i), it is natural to implement A,(o) 
in Eq. (153) as a function A,(j;,i-), called the vertex amplitude associated with 
the vertex v, where jf and 7. are the colorings for the faces and edges adjacent to 
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v. Consequently, Eqs. (151) and (152) lead to the formal spin foam expression of 
the amplitude transition 


Wk) = S> w(C(o)) DT] AGiried, (155) 


Oo0=kUk’ Jfrte v 


where 00 = k Uk’ indicates that the boundary (i.e. initial and final s-knots) of 7 
is given by the union of k’ and k, and w(T) is a weight factor that depends only 
on the two-complex. The weight factor w(I) is a generalized form of w(N), where 
N = N(T) is the number of vertices of I. 

It is often more convenient to recast Eq. (155) in the extended form 


Wik, k’) = ey w(T(o)) a II As (is) [[4-G:: te) [[ 4G: te), (156) 


Oo=kUk! Yprte. of 


where Ay and Ae are the amplitudes associated with faces and edges, which in 
principle can be absorbed into a redefinition of A,,. For most models in the literature, 
A f(jp) is simply given by the dimension dim(jy) of the irreducible representation 
jf. Consequently the transition amplitude is given in the form 


W (k, k’) = > w(P(o)) > [[ dims) [] 4c (ir, te) [] Avr, te), (157) 


Oa=kUk’ Jfste f be 


and accordingly the partition function is given by 


Z= » w(T(o)) + II dim(js) II Ae Gir, te) II Av (f, te). (158) 
o f e v 


Uf whe 
A spin foam model is then formally defined by the specification of 


(i) A category of two-complexes T and the weight factor w(T). 
(ii) A Lie (or deformed Lie) group and associated irreducible representations j 
and intertwiners Ze. 
(iii) Functions of the vertex amplitude A,(jy, i.) and the edge amplitude Ag (je, ie). 


There are various spin foam models that have been elaborated.'4!" They pro- 
vide tentative quantum theories of 3d GR, 4d BF theories, 4d Euclidean GR, etc. A 
remarkable feature is that many very different models all coincide with the form of 
Eq. (158), suggesting that this expression can be viewed as a general background- 
independent formalism of covariant QFTs (also see Sec. 12.6). 


10. Black Hole Thermodynamics 


From theorems proved by Hawking et al.,9*:°* which reveal a remarkable resem- 
blance between a set of laws obeyed by black holes and the principles of ther- 
modynamics, Bekenstein suggested that a Schwarzschild black hole should carry an 
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entropy proportional to the area of its event horizon divided by the Planck area®® °” 
A 
Spu = ake, 159 
BH = 4B GF (159) 


where kg is Boltzmann’s constant, a is a proportional constant, and A = 47(2G.M)? 
is the area of the event horizon with M being the mass of the black hole. By the 
standard thermodynamical relation T~' = dS/dE = dS/dM, this implies that the 
black hole has a finite temperature 


h 
T = ——_—.. 
a32rkpGM 


Shortly after Bekenstein’s conjecture, by studying QFT in a gravitational back- 
ground, Hawking showed that black holes emit thermal Hawking radiation®®-°9 
corresponding to the Hawking temperature 


h 
T = —___ 
8rkpGM’ 


which tightly confirms Bekenstein’s conjecture and fixes the constant a = 1/4. 
Equation (159) with a = 1/4 is often referred to as the Bekenstein—Hawking formula 


Spu = — (162) 
where the subscript “BH” coincidentally stands for both “black hole” and 
“Bekenstein—Hawking” . 

In the modern interpretation of entropy in statistical mechanics, entropy 
(divided by kg) is defined as the logarithm of the number of admissible microstates 
for a given macroscopic state. What then are the microscopic degrees of freedom 
responsible for the black hole entropy? Can we derive Eq. (162) from first princi- 
ples? As h enters in Eq. (162), answers to these questions require a quantum theory 
of gravity. In LQG, a detailed description of black hole thermodynamics has become 
an active direction of research. One of the major achievements of LQG is to derive 
the Bekenstein—Hawking formula from the first principles for the Schwarzschild and 
other black holes.100 108 

In Secs. 10.1 and 10.2, we present the basic ideas, following Refs. 109 and 110 
(also see Sec. 8.2 of Ref. 9). Readers are referred to Chap. 15 of Ref. 10 for a rigorous 
treatment and Sec. 8 of Ref. 8 for a shorter account. More recent advances in the 
black hole entropy can be found in the excellent review paper Ref. 111. We mention 
some of the important results in Sec. 10.3. 


(160) 


(161) 


10.1. Statistical ensemble 


The microscopic degrees of freedom responsible for the black hole entropy Spry are 
those that can participate in the energy exchanges with the exterior. Because, by 
definition of black holes, the states of gravity and matter inside a black hole have no 
effect on the exterior, the microstates of the interior of the black hole are irrelevant 
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for Sgy measured from the exterior.” Observed from the exterior, therefore, the 
entropy of the black hole (for its thermal interaction with the surrounding exterior) 
is completely determined by the geometry of the event horizon. 

More precisely, the microcanonical ensemble is composed of the microstates of 
the horizon geometry that give rise to an exactly specified total area. Denote by 
N(A) the number of the microcanonical ensemble of area A, the quantity S(A) = 
kp ln N(A) is then the entropy for the black hole as far as its thermal interaction 
with the exterior is concerned. 

Given any arbitrary surface (say, not a horizon of any kind), we can of course 
ask how many microstates of the surface correspond to a specific area, but the 
number of the microstates usually has no thermodynamical significance. As heat or 
information can flow across a surface without changing its geometry, the geometry of 
a surface and thus the number of its microstates in general have nothing to do with 
the heat exchange or the notion of entropy. As originally argued by Jacobson,!!? 
it is for any causal horizon that the heat flow always accompanies a stress—energy 
tensor that distorts the horizon geometry and consequently the entropy measured 
by the observer who is separated from the system by the causal barrier is given by 
the geometry of the causal horizon (see also Refs. 113-115, in the context of LQG). 
The black hole entropy given by the geometry of the event horizon is a special case 
of Jacobson’s argument. 


10.2. Bekenstein—Hawking entropy 


Let us now compute the number N(A) from the quantum geometry of LQG. First, 
consider that the quantum state of the geometry of a 3d ADM leaf }; is given by 
an spin network state |s). The horizon is a 2d surface S imbedded in ¥; and its 
geometry is determined by its intersections with the spin network |s). As we are 
interested in the geometry as probed from the exterior, the surface S we consider 
is, more rigorously, the one that is outside but infinitesimally close to the horizon. 
Therefore, we exclude the possibility that the spin network nodes lie on S and, 
instead of Eq. (75), the area of S is simply given by Eq. (74) 


A(S) = 8mGhy > Vaihi + 2), (163) 


where j1,...,Jn are the colorings of the links intersecting the surface S. 

For a given A, how many admissible microstates of the horizon are there then? 
From the perspective of an external observer, an admissible microstate corresponds 
to a possible choice of the sequence j1,...,jn that gives rise to A via Eq. (163) 
and a possible way of “ending” the links to the exterior. A possible “end” of a link 
with coloring j is simply a vector in the vector space of the j representation, which 
has a (27 + 1)-fold multiplicity. To sum up, the admissible microstates are obtained 


‘For instance, in the Kruskal extension of a Schwarzschild black hole, the region IIT may contain 
billions of galaxies, but these do not bear any detectable consequences for us in the region I. 
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by considering all sequences 71,...,jn that give the area A, and for each sequence 
there is a multiplicity of [],(2j; + 1) possible states. 

For a large A much greater than the Planck area (7, = Gh, we assume that 
the number of admissible microstates is dominated by the case of 7; = 1/2 for all 
intersecting links (we will come back to this assumption shortly). In this case, the 
area of a single link is given by 


and thus the number of intersections is 


A A 
Aj=1/2  AnV/3 Ghiy 


= 


(165) 


Each j = 1/2 link has a multiplicity of (2/+1) = 2, so the total number of admissible 
microstates for a given large A is given by 


N(A) = 2" = 24am, (166) 


Consequently, we obtain the entropy of the black hole 


_i m2) A 
~ deal GH 


which agrees perfectly with the Bekenstein-Hawking entropy Eq. (162) provided 
that the Barbero-Immirzi parameter is fixed at the value 


Spy = kp In N(A) (167) 


In2 
y= —< = 0.127384... (168) 


Ee 
The Bekenstein—Hawking entropy can be calculated for different kinds of black holes 
by LQG. The striking feature is that they all lead to the same value of y and thus 
ensure the consistency of the black hole thermodynamics under the framework of 
LQG. 

We have made the assumption that the leading contribution to the entropy 
comes entirely from the lowest states (7; = 1/2), as had been suggested in Ref. 102 
and considered true for several years. The detailed computation of Refs. 116 and 117 
based on a practical rephrasing of the combinatorial problem pointed out that in 
fact all quantum states (not only the lowest ones) must be taken into account. This 
leads to a revision from Eq. (168) for the value of y, which can be numerically 
calculated at arbitrary accuracy!!” as 


+) =m = 0.23753295796592.... (169) 


The simple calculation presented above on the flawed assumption nevertheless 
grasps the conceptual essence of relating the number of microstates to the black 
hole entropy. 
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10.3. More on black hole entropy 


Section 10.2 gives a simple account for the black hole entropy from first principles 
of LQG. A more rigorous approach is the modeling of black holes in LQG by using 
isolated horizons as inner boundaries.'°° 1°? Quantization of the degrees of freedom 
for the isolated horizon can be understood as a Chern—Simons theory. Obtained by 
tensoring the Chern—Simons boundary Hilbert space and the LQG bulk Hilbert 
space, the Hilbert space of the resulting model provides the groundwork to address 
the statistical problems of black holes. 

The combinatorial problem associated with the computation of the black hole 
entropy can be rephrased in a more manageable way ,!!6!1" 
the asymptotic behavior of the black hole entropy in the limit of large horizon area. 
For A > @},, the computation gives an exact formula to the subleading order as 


tie —ym Aol, A, Ay? 
fe SNA) 7 or sng +o((4 ; (170) 


where ym is a constant, whose numerical value is given by Eq. (169). This formula 
agrees with Eq. (159) on the proportionality to A in the leading order and also 
obtains the precise subleading order correction as a logarithmic function of A. 
Comparison to the Bekenstein—Hawking formula in Eq. (162) fixes the Barbero— 
Immirzi parameter y to be 7 = yu. 

For small areas, the precise number counting was first suggested in Refs. 118 
and 119 and later has been thoroughly investigated by employing number- 
theoretical and combinatorial methods (see Ref. 120 for a detailed account and 
Ref. 111 for a review). The key discovery is that, for microscopic black holes, 
the so-called black hole degeneracy spectrum when plotted as a function of the 
area exhibits a persistent “periodicity” (more precisely, a modulation with a regu- 


allowing one to compute 


lar period of growing magnitude). This produces an effectively evenly-spaced area 
spectrum, despite the fact that the area spectrum in LQG is not evenly spaced, 
and makes contact in a nontrivial way with the evenly-spaced black hole horizon 
area spectrum predicted by Bekenstein and Mukhanov under general conditions.!?! 
The periodicity in the degeneracy spectrum leads to a striking “staircase” structure 
with regular steps when the black hole entropy is plotted as a function of the area. 
The staircase behavior eventually disappears in the large area limit. 

The framework of precise number counting also predicts the subleading correc- 
tion to the Bekenstein—Hawking law. For various model settings, the subleading 
correction generically takes the form akg In A/ Es where the coefficient a1 is inde- 
pendent of the value of y but differs for different models. The logarithmic correction 
is qualitatively in agreement with those obtained by different approaches (includ- 
ing those based on asymptotic symmetries, horizon symmetries, and certain string 
theories), despite very different physical assumptions. There are some indications 
that even the coefficient a, might be universal, up to differences on the treatment 


of angular momentum and conserved charges.!? 
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Recently, an explicit SU(2) formulation for the black hole entropy has been 
developed based on covariant Hamiltonian methods.!?? !?° This formulation avoids 
the partial gauge fixing of the standard approach and gives rigorous support for the 
earlier proposal that the quantum black hole degrees of freedom could be described 
by an SU(2) Chern—Simons theory. 


11. Loop Quantum Cosmology 


Loop quantum cosmology (LQC) is a finite, symmetry-reduced model of LQG that 
applies principles of the full theory to cosmological settings. Thanks to the math- 
ematical simplifications, many obscure aspects (e.g. quantum dynamics and semi- 
classical physics) in the full theory become transparent in LQC. The framework of 
LQC provides a “bottom-up” approach to the full theory, in which many ideas of 
LQG can be explicitly implemented and tested. 

The distinguishing feature of LQC is that the quantum geometry of LQG gives 
rise to a brand new quantum force that is inappreciable at low spacetime cur- 
vature but rises very rapidly and opposes the classical gravitational force in the 
Planck regime. As a consequence, for a variety of models of LQC, the cosmological 
singularity (big bang, big crunch, big rip, etc.) is avoided by the opposing force, 
therefore affirming the long-held conviction that singularities in GR signal a break- 
down of the classical theory and should be resolved by the quantum effects of 
gravity. Particularly, the big bang singularity is replaced by a quantum bounce, 
which bridges the present expanding universe with a preexistent contracting uni- 
verse. The new cosmological scenario suggests a change of the paradigm in the 
standard big-bang cosmology. 

In what follows, to illustrate the basic ideas of LQC, we consider the simplest 
setting — LQC in the k = 0 Friedmann—Lemaitre—Robertson—Walker (FLRW) 
model and particularly follow the lines of Ref. 126. See Refs. 18-21 for more detailed 
construction and more models of LQC. 


11.1. Symmetry reduction 


The first step is to impose symmetries of cosmology — spatial homogeneity and 
oftentimes also isotropy — on the phase space variables before quantization. Sym- 
metry reduction deliberately ignores infinitely many degrees of freedom and thus 
brings up the question whether the results from the full theory of LQG, if can 
be obtained after all, should resemble any predictions by LQC. The arguments in 
Sec. 1.2 of Ref. 19 suggest that the answer is likely to be affirmative, provided that 
the framework of LQC captures the essential features of the full theory. 

In the k = 0 FLRW model, both homogeneity and isotropy are assumed and 
the spacetime metric in the comoving coordinates is given by 


dr? = gyyxdax” = —N(t)dt? + a(t)?da’?. (171) 
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With the imposition of homogeneity and isotropy, the fundamental fields A’,(a) and 
E% (zx) are reduced to two variables c and p 


A‘ (x) =, E* (a) => p, (172) 


which are independent of x because of homogeneity and satisfy the canonical rela- 
tion 


8 
{ep} = grGy, (173) 


which is the symmetry-reduced counterpart of Eq. (17). As E% is related to “area” 
in the sense of Eq. (73), the symmetry-reduced variable p represents the area of the 
surfaces of a finite-sized cubic cell V (which is chosen to make sense of the spatial 
integral f;, d°a — f\, d°x). More precisely 


onal, (174) 


where a is the scalar factor in Eq. (171) and L is the coordinate length of the edges 
of V. 

Under the symmetry reduction, the Gauss constraint is trivially satisfied, and 
the diffeomorphism constraint is no longer an issue as the diffeomorphism invariance 
is gauge fixed in accord with the homogeneity. Only the scalar constraint remains 
significant. In the k = 0 FLRW setting, we have F = dA+AAA= AAA and 
K = (A-T)/y = A/y, which lead to F = AA A = 77K A K, or equivalently 
en. re = 2At Ai = 29? Ki Ki. Correspondingly, Eq. (22) becomes 


~a ~b ~a ~b 
N EE. .. N E.E. oe 
a ————— | @a—_— 4, Fe = -—a | d'x—— 2A) Al, (175 
[N] 87rGy? Jy Jal 8rGy? Jy / lal 4 ) 
where N = N(t) is independent of x, and the spatial integral |, d°z is restricted to 
Lye = hr dxy i dag i. dz3 to make C[N] finite. By Eq. (172), it follows from 


Eq. (175) that the gravitational part of the scalar constraint in terms of c and p is 
given by 


6N p? 
81Gy? \/p3 
where the lapse function N(a,t) = N(t) is chosen to be homogeneous. 

In the full theory, instead of A(x) and E(a), the holonomy h., and the electric 
flux over a surface Ks, ¢ are used as the fundamental variables for loop quantization. 
In the symmetry-reduced theory, analogous to Eq. (29), the variable c is replaced 
by the “holonomy” of c, defined as 


Ne = eer (177) 


Cerav(N) = — ce (176) 


for an arbitrary « € R. On the other hand, corresponding to Eq. (30), p remains 
the good variable, as it is redundant to specify S and f(a) in the homogeneous and 
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Table 3. Symmetry reduction in the k = 0 FLRW model. 


In the full theory In the k = 0 FLRW model 
At (x) c 
Ej (2) P 
: ~b : 
{Aj (2), E; (y)} = 8&9Gy SOL (a = y) {c, p} = 8nGy 
CoD] = 
Coie [N] — 
C[N] in Eq. (22) Cerav(N) in Eq. (176) 
holonomy: hy in Eq. (29) Ny = ett’/2, weR 
electric flux: Eg - in Eq. (30) Pp 
loop algebra: {hy, Es ¢} =... {Nu P} = iS uM, 


isotropic setting. The variables NV, and p form the closed “loop algebra” 


ieee am 


The symmetry reduction into the k = 0 FLRW model is summarized in Table 3. 


pN, end {Na,Nu} =0, (178) 


11.2. Quantum kinematics 


Analogous to the linear space Cy] of cylindrical functions of A, i.e. Up, [A], defined 
in Eq. (36) in the full theory, we begin with the linear space Cyl> of almost periodic 
functions of c defined as 


= Da beNi (c)= Df etnne/2 (179) 


where k runs over a finite number of integers, wz € R, and & € C. Analogous to 
the inner product of Cyl defined in Eqs. (37) and (38), the inner product of Cyl® 
is defined via 


(Mur | Niu2) = (ua | H2) = Oppo (180) 


where the right-hand side is the Kronecker delta, but 41, Ww2 € R take continuous 
values. This inner product is very different from that defined via (11 | 42) = 6(f41 — 
12) with the Dirac delta function. To highlight this nuance, Cyl® is said to be in 
the “polymer representation” of ju. 

The (gravitational part) of the kinematical Hilbert space H®,, is the Cauchy 
completion of Cyl® with respect to the inner product Eq. (180). That is, Ts = 
L?(Rponr, dor), where Rgonr is the Bohr compactification of R and dygonr is the 
corresponding measure. An orthonormal basis of To is given by {N,| u € R}. By 
Dirac’s bra-ket notation, the map c++ e#°/? is denoted by |) with 


e'? = (ely), (181) 
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and consequently we have 


I) = >In) | ¥) = > Ww). (182) 


Analogous to the holonomy, area, and volume operators defined in Sec. 5, we 
have corresponding operators acting on H§,,, 


N,W(c) = e'F U(c) := e'F V0), (183) 
BU(c) = mm A, (184) 
VU(c) = |p)? Vo), (185) 


where fp; = VGhi is the Planck length. When acting on the basis state |), these 
operator behave as 


Nucla) =F |) = w+ w!), (186) 
x 81y 
b\w) = = Sele), (187) 
; ae ee 
Pa = Valeo = (FE al) eal (188) 


The correspondence for quantum kinematics between the full theory and the 
symmetry-reduced theory is listed in Table 4. 


Table 4. Quantum kinematics of LQG and LQC in the k = 0 FLRW model. 


LQG LQC in the k = 0 FLRW model 

Cyl Cyl§ 
cylindrical functions: almost periodic functions: 

Vr, [A] in Eq. (36) U(c) = Dy Seu, = Dn Creer’? 
inner product: inner product: 

(Wr, ¢ | Yr,g’) in Eqs. (37), (38) (Nur |Nuo) = (ui | M2) = Suine 
H = L?[A, dwar] HE,, = L?(Reonr, d/Bonr) 
IT, jr, 21, G1) in Eqs. (39), (40) 1) 
spin networks |’, 77, in) |) (Gauss constraint trivial) 
s-knots (K, c| |) (no diffeomorphism constraint) 


holonomy operator: hy in Eqs. (63), (64) Ni |b) = eiv'c/2/4) = [w+ p') 
area operator: E(S) in Eq. (75) blu) = Me £2, bl L) 


z & 3/2 
volume operator: V(R) in Eq. (83) Vi[y) = (22 [u!) ilu) = Valu) 
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11.3. Quantum constraint operator 


Just as in the full theory, we have to regularize the scalar constraint before quan- 
tization. In Eq. (175), the nonpolynomial factor 1/ Jal is dealt with Thiemann’s 
trick and Fy, is replaced by a holonomy long a square loop O. Corresponding to 
Eqs. (100) and (101) (with RG = RO/?), CT = 1), it turns out Cgray with N = 1 
can be written as 


Cra = jim 1 Cfdvs (189a) 
4sgn(p) 7 (A) p00) (,)\ 27,2, A) £7, -1 
B igkm, 
COM. = - SS eM Tr hy hy (hg) (a) hg (hE) VP) 
814772 G a 
4 sgn(p (A) f7,))-1 
= sin de ~Feig LR eA {() VS) sin re, (189b) 
where 
4 d 
no = cos > + 27; sin > (190) 


is the holonomy along the edge of 0, which is in the x direction and of coordinate 
length AL, and \ plays the role of € in Eq. (100) (more precisely, AL = €). 

It is straightforward to obtain the quantum operator eae associated with o. 
but the limit A — 0 is ill-defined. Unlike the full theory of LQG, the dependence 
of Ce on the regulator A does not go away because the diffeomorphism has been 
gauge fixed. In order to inherit the spatial discreteness from LQG appropriately, 
we take the prescription by hand to shrink the square loop O to a finite area equal 
to the area gap A, i.e. the smallest nonzero eigenvalue of area, given by Eq. (76). 
Consequently, we are led to choose for X a specific function 4 = fi(p) given by 


Area of 0 = a?(\L)? = ji? |p| = A, (191) 


or equivalently 


A vV3v3 1 


f= : (192) 
Pl V2 
Equation (186) suggests that, in a heuristical sense, 
FW() = eM U(p), (193) 


even though the differential operator ta is ill-defined in L?(Rponr, diponr). Let us 
define the affine parameter 


2/2 


v= K sen()|u|°/”, with K = 
3V3 


(194) 


WAs this is only a phenomenological prescription, different numerical values for A of the same 
order of magnitude are also used in the literature. 
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such that formally 
pj— = —. (195) 


In the v-representation, the action of e’2 can then be defined as 


= Uy )=WV(v+1), (196) 
and correspondingly Eq. (188) leads to 


> Any we lvl 3 
Vie) = s x file): (197) 


As e’'> and V are now well defined, Of, in Eq. (189b) can be promoted to 
the quantum operator 


Cas = sin(fic) Ea (sin & V cos C cos 7 V sin e)| sin(jic), (198) 


where we have chosen the symmetric ordering to make so hermitian. Its action 
on W(v) is given by 


CoravU(v) = fy(v)U(v + 4) + fo(v)U(v) + f-(v)W(v — 4), (199) 
where 
fi(v) = al = sare + 2i|lv +1] — |v +3)], (200a) 
f-() = fav -4), (200b) 
fo(v) = —f+(v) — f-(v). (200c) 


Meanwhile, to make the quantum dynamics more transparent, we add a free 
massless scalar ¢(a,t) = ¢(t) to serve as the internal time. At the classical level, 
the matter part of the scalar constraint is given by 


Craik = = 87G|p|— 3/2, i (201) 
where 
ps = BLS (202) 


is the momentum of ¢, and db is the derivative with respect to the proper time. 
The matter part of the kinematical Hilbert space is given by the ordinary construc- 
tion L?(R,dé) without any “holonomization.” However, the inverse triad factor 
lp|-°/ still has to be regularized by Thiemann’s trick as for 1 if lg] tt Coon, Ub 


to quantization ambiguities, the quantum operator for Ip|-?/ ? turns out to be 


—~ 3/2 
pr 0)= (Gar) BOWE, (208) 
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where 
B(v) = (3) lly +19 = oa)", (204) 
Consequently, the total quantum constraint equation is given by 
CW(v, $) = (Cgrav + Cmatt)Y(v, $) = 0, (205) 
which is explicitly expressed as 


O5U(v, 9) = B(v)~* (Cy (v)U(v + 4,9) + Co(v)¥(v, d) + C_(v)¥(u — 4, 4)) 


= HOG @), (206) 
where 
Cty ance lv + 2||lv + 1] —|o-+3ll, (207a) 
on one (207b) 
Ci) =O — CG). (207) 


11.4. Physical Hilbert space 


The form of the constraint equation (206) suggests that it is natural to regard ¢ 
as the internal time, with respect to which U(v) evolves. To implement this idea, 
an appropriate kinematical Hilbert space for both geometry and the scalar field 
is chosen to be Hkin = L?(Rponr, B(v)dvponr) ® L?(R,d¢@). It is straightforward 
to show that both the operators pg = —ihOg and © are hermitian in Hyin, and 
furthermore 0 is positive definite. 

The operator © is a second-order difference (cf. differential) operator. Conse- 
quently, the space of physical states (i.e. the space of solutions to the constraint 
equation) is naturally divided into sectors each of which is preserved by the “evolu- 
tion.” Denote by £, the “lattice” of points {le] + 4n;n € Z}U{—|e] +4n;n € Z} in 
the v-axis and denote by H8"®” the subspace of L?(Rponr, B(v)dvponr) with states 
whose support is restricted to lattice £,. Each of H8"*Y is mapped to itself by O. 

Let e,(v) in H8*” be the eigenstate of O 


Ge, (v) = w(k)%ex(v), (208) 


where w(k)? is the corresponding eigenvalue. The general solution to the quantum 
constraint Eq. (206) with initial data in H8"*’ is then given by 


U(v, d) = i 7 dk (U4 (k)ex(v)e? + U_(k)e,(v)*e?), (209) 


—oo 


where Wi(k) are in L?(R,dk). The positive/negative-frequency solutions satisfy a 


Schrédinger type first order differential equation in ¢: 


OVE 
are VOW,. 


(210) 
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Therefore, the positive/negative-frequency solutions with the initial state V(v, do) 
are given by 


Wa(v, $) = et?¥9 (-40) Hu, do). (211) 


The positive-frequency and negative-frequency solutions can be discussed sepa- 
rately, and we focus on the former ones in what follows. 

The sector of the physical Hilbert space Hohys labeled by € € (0, 2] consists of 
positive-frequency solutions U(v,¢) to Eq. (210) with initial data U(v,¢9) in the 
symmetric sector of 7(2°Y.* The physical inner product is defined as 


(Wi | Vo)onys = y, B(v)V1(v, d0)*Vo(v, do). (212) 
ve {+le|+4n;neEZ} 


It is easy to show that this definition is independent of the choice of ¢o9 as long as 
WV 2(v, ¢) are solutions to Eq. (210). 

Dirac operators acting on Hphys are pg and a family of operators lel 
parametrized by ¢, defined as 


pov, d) = =n ee), (213) 
Jul g¥(v, d) = e&¥© F-9) ju] W(w, do). (214) 


Both pg and lel preserve the sector 7(5),,, and are hermitian with respect to 

(-|-)bnys: The expectation value (U| 54|V) pnys gives the momentum of ¢, which is a 

constant of motion, i.e. independent of ¢. On the other hand, the expectation value 

W\lv|,.|Y)pnys tells how big the volume of V is when the internal time takes the 
o|* / phy 


value ¢, in the style of Eq. (123). It should be noted that the definition of lel, in 
Eq. (214) is independent of the choice of ¢o. 


11.5. Quantum dynamics 


The eigenvalue problem Eq. (208) can be numerically solved (by a computer), 
and the eigenfunctions e,(v) can be used to build coherent states. Particularly, 
in Eq. (209), a natural choice is to set U_(k) =0 and 


Wi (k) =e" ae ete", (215) 
with a suitable small spread a, to obtain a coherent physical state Vpsv5, (v, d). 
The resulting state is peaked at (Ups.03, | Po|Vpx v3, )phys = Pj = —V127G hk* and 


at (Ups v3 | oly, Mox.0%, )phys = U%,, Where vy, is determined by ¢”*. 
To give a coherent state that is semiclassical at late times, we have to choose 
large values for p% (i.e. k* < —1) and for vj, (i.e. vj, > 1). Given such a coherent 


*The sign flip v — —v corresponds to reverse of the triad orientation. Because of no parity 
violating processes, we are led to choose the symmetric sector in which U(—v) = V(v). 
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physical state, the evolution with respect to the internal time @ can be read out 
from the expectation value 
(Us,03, | vl | Ups,vs )phys 


(v(@)) = oo (216) 


(Wps,u5, | Yos.v3,) 


phys 

The numerical results show that the trajectory of (v(¢)) follows the classical solution 
at large scale but at the Planck regime undergoes the quantum bounce, which bridges 
the expanding classical solution with the contracting one (see Fig. 4 in Ref. 126). 
The quantum bounce takes place when the energy density pg of the scalar field 
approaches the critical density poit & 3/(8mGy?A) = V3/(167?7°G?h) of the 
Planck scale. More precisely, the energy density operator is defined as 


a 2 a 

a Po kK 3 Py 
— = 217 
Palo sips = “5 Canaan) Fa alk 


and its expectation value 


(Ups 05, | PolelUps.os, )phys 


(po($)) = 
(Up3 05, | ce ae 


_ Kf 3 \* 
«= (gam) TOR oe 


is bounded above by perit (see Fig. 6 in Ref. 126). 

If the internal time ¢ is expressed in terms of the proper time t via a°L°dé = 
(pg)dt = pdt in accord with Eq. (202),” the trajectory of (v(@)) in relation to 
(p4(@)) can be described rather accurately by the effective equation for the scalar 


factor a(t) 
wy 
a 87G Po 
= 1 219 
(<) 3 me ( Perit ) ; ( ) 


which is the Friedmann equation modified by the opposing quantum force. 


11.6. Other models 


A slight simplification in the model discussed above leads to the “solvable LQC” 

(sLQC).!?" In sLQC, analytic solutions are available and it is shown that the quan- 

tum bounce is generic not only for coherent states, but for any arbitrary physical 

states and the matter density has an absolute upper bound given by 
3 

it = ——S 220 

ans 8nGy7A en) 


YThe change of variables from ¢ to t makes sense only for sharply peaked coherent states. In the 
quantum theory, the internal time is more fundamental than the proper time, which is a derived 
notion only meaningful for sharply peaked coherent states. 
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The construction of LQC has been extended to a variety of models with various 
degrees of rigor, including k = +1 (Ref. 128) and k = —1 (Ref. 129) FLRW models, 
possibly with a nonzero cosmological constant,!20 !82 Bianchi I,1°3-'87 Bianchi II!°8 
and Bianchi IX!°° models, Kantowski-Sachs model,'4°4! unimodular model,!4? 
higher-order holonomy extension,!4*:!44 and much more. The bouncing scenario 
has been shown to be robust in different models. 

Particularly, the formalism of LQC in the Kantowski-Sachs model can be used 
to study the loop quantum geometry of the interior of a Schwarzschild black 
hole.'4°-149 The study suggests that, just like resolution of the cosmological sin- 
gularity, the classical black hole singularity is also resolved by the loop quantum 
effects and replaced by the quantum bounce, which either bridges the black hole 
interior to a white hole interior or gives birth to a baby black hole.'49:7 

Furthermore, inhomogeneity has also been taken into consideration in the 
Gowdy model (the simplest of the inhomogeneous models)!°? 1° as well as in the 
framework of “lattice LQC” .15%:157 


12. Current Directions and Open Issues 


In this review article, we gave a self-contained introduction to the canonical for- 
mulation of LQG and briefly covered the ideas of the spin foam theory, black hole 
thermodynamics, and LQC. The core formalism of LQG provides a coherent frame- 
work in which the fundamental principles of GR and QFT consort with each other 
in harmony without invoking additional hypotheses. The key result of LQG is a 
compelling physical picture of quantum space, which is made up of quantized areas 
and volumes of the Planck scale in a profoundly background-independent fashion. 
Despite many remarkable achievements, the theory of LQG is still far from com- 
plete, especially for the aspects of quantum dynamics and low-energy limit. A wide 
range of research has been developed for the open problems. We conclude this 
review by outlining a nonexhaustive list of recent advances that are not covered in 
the main text and also addressing some open issues along the list. One can easily 
appreciate that LQG has grown into a vast and active area of research along many 
various directions. 


12.1. The master constraint program 


The scalar constraint remains the major unsolved problem in LQG as noted in 
Sec. 6. The master constraint program! (also see Sec. 10.6 of Ref. 10 for a system- 
atic account) proposed an elegant solution to the difficulties concerning the algebra 


*Validity of this approach however might be questionable, since the Schwarzschild interior is 
extensible beyond its boundary (event horizon) and thus cannot be considered as a self-contained 
universe (i.e. it is not a genuine Kantowski-Sachs spacetime). The more rigorous approach is to 
consider the black hole interior and exterior as a whole in the framework of spherically symmetric 
midisuperspaces (see Sec. 12.10). 
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of commutators among smeared scalar constraints in terms of the so-called master 
constraint, which combines the smeared scalar constraints C[N] := J, d?aN(«)C(«) 
for all smearing functions N (2) into a single constraint 


OC 2 
M = | Par (2) , 
= vid 
where the factor 1,/|q| has been incorporated to make the integrand a scalar density 


of weight 1 so that M is diffeomorphism invariant, i.e. {Cpig[N], M} = 0. It is clear 
that M = 0 implies the infinitely many constraints C(a) = 0 for Vz € ¥ and vice 


(221) 


versa. 

The single constraint M in place of C[N] greatly reduces the number of con- 
straints and drastically simplifies the Poisson bracket structures among constraints: 
Eq. (24e) is replaced by {Cpia[N], M} = 0, and more notably the complicated struc- 
ture of Eq. (24f) is now reduced to the trivial algebra {M,M} = 0. As the master 
constraint renders the constraint algebra as a genuine Lie algebra, the program 
opens a new avenue for understanding the quantum dynamics and establishing the 
semiclassical limit. 

The master constraint program has various advantages in an attempt to define 
the physical inner product and formulate rigorous path integrals, '°9:!®° 
to be more directly related to the spin foam formalism than the standard formula- 
tion of LQG. 


and it seems 


12.2. Algebraic quantum gravity 


The master constraint program has evolved into a fully combinatorial theory known 
as algebraic quantum gravity (AQG)!*! 1% (also see Sec. 10.6.5 of Ref. 10). AQG 
is closely related to yet very different from LQG by the fact that no topology 
or differential structure of space is assumed a priori. In this sense, background 
independence of AQG is even more compelling, and it is possible to talk about 
topology change. A viable semiclassical machinery that involves only nongraph- 
changing operators has been suggested, and considerable progress has been made 
in providing contact with the low-energy physics. 


12.3. Reduced phase space quantization 


There are two major approaches to the canonical quantization of a field theory with 
gauge symmetries: the so-called “Dirac approach” and the “reduced phase space 
approach.” In the Dirac approach, one first construct the Hilbert space that repre- 
sents the partial (gauge-variant) observables and then impose the constraints at the 
quantum level to select the physical (gauge-invariant) states. In the reduced phase 
space approach, one first constructs the physical (gauge-invariant) observables at 
the classical level and then directly construct the Hilbert space that represents 
the physical observables. (Of course, it is possible to take a “mixed” approach by 
implementing different strategies for different gauge symmetries.) 
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The advantage of the Dirac approach is that the algebra of partial observables 
is usually simple enough and thus the (kinematical) Hilbert space is easy to con- 
struct, but one has to deal with spurious degrees of freedom, which are the possible 
source of quantization ambiguities and quantum anomalies. On the other hand, the 
advantage of the reduced phase space approach is that one never has to care about 
the kinematical Hilbert space at all, but the induced algebra of physical observ- 
ables is typically so complicated that the corresponding (physical) Hilbert space is 
extremely difficult to find. 

The standard formulation of LQG adopts the Dirac approach, as constructing 
the reduced phase space of GR with standard matter is uncontrollably nontrivial. 
However, there is a hope of obtaining the reduced phase space with a manageable 
induced algebra, if one adds pressure-free dust!® or a massless scalar field’® to the 
theory. This is essentially because, at the price of introducing additional matter, 
the constraints can be deparametrized with respect to the dust or the scalar field, 
which serves as a material reference system. 

Schematically (consider only the scalar constraint), when the system is 
deparametrized, one can find a partial variable T serving as an “internal clock” such 
that the local scalar constraint (at least locally) reads as C[q*, pa] = P + hlq*, pal, 
where P is the momentum conjugate to T and both T and P are independent of 
other conjugate pairs (q"(x),pa(a)). One can identify a one-parameter family of 
the physical observables Or associated with the partial operator O such that the 
map F, : Or Or, for any 7 is a homomorphism between the Poisson bracket of 
partial observables and a certain Dirac bracket (uniquely determined by the con- 
straints and the choice of T’) of physical observables. This implies that the reduced 
phase space approach is achievable, as finding the Hilbert space representation of 
Or is as easy as that of O. In the resulting Hilbert space, the physical observable 
Or, is interpreted as the observable O measured when the internal clock T’ takes 
the value 7, and (as heuristically P acts as —i0/0T) the dynamics is generated by 
the evolution equation 


6, = -i[A,O;], (222) 
where 
A= fac fla"(),pal2) (223) 


is called the physical Hamiltonian in contrast to the Hamiltonian constraint as 
it corresponds to a nonvanishing physical observable that generates the genuine 
“physical” evolution. 

Based on the model of Ref. 165 with pressure-free dust, the reduced phase 
space approach to a quantum theory of GR has been formulated in the styles of 
LQG167,168 as well as of the Master constraint program and AQG.'® Based on the 
model of Ref. 166 with a massless scalar field, the reduced phase space approach 
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has also been constructed in the style of LQG.'® These studies have led to a new 
appealing direction of QG alongside the standard LQG. 


12.4. Off-shell closure of quantum constraints 


In the Dirac approach of canonical quantization, quantizing a constrained system 
faithfully, one would expect that the corresponding quantum operators represent 
the same structure of the classical constraint algebra. That is, in the case of gravity, 
in accordance with the constraint algebra given by Eq. (24), there exists a vector 
space V of spin networks upon which, for all |W) € V, the quantum operators act 
as 


Uns Uy,|V) = Carol Os (224a) 
Th CIN] U,|¥) = Cly*N]|¥), (224b) 
[CIN], C[M]]|©) = Cis [q*?(NO,M — MAN)]|¥), (224c) 


where we have disregarded the Gauss constraint Cg = 0, which is trivially satis- 
fied via intertwiners. This, however, is not the case in the standard formulation 
of LQG. In the diffeomorphism covariant regularization scheme as addressed in 
Sec. 6.2, Eq. (224a) is trivially satisfied and Eq. (224b) corresponds to Eq. (112), 
but Eq. (224c) on the other hand is replaced by [see Eq. (113)] 


((CLN], CIM] n(¥)|®) := lim (n(¥)| [Cr [N], Cr, [Ml] ®) = 0, (225) 


for all ® € V, where 77 is the linear map in Eq. (58). In this sense, the standard 
LQG is said to be anomaly-free only at the on-shell level, but not off-shell. 
Although there is no obvious self-inconsistency, the fact that the quantum alge- 
bra is not off-shell closed arouses various worries. It was argued in Refs. 170 and 171 
that quantum anomaly is hidden on-shell simply because the local scalar constraint 
C in Eq. (16c) is specifically of density weight 1. To see this heuristically, consider 
the local scalar constraint C) ~ jg PF Pee -- of density weight & associated 
with the smearing function N“@—*) of density weight 1 — k. The scalar constraint 
in d spatial dimensions (d > 2) is given by C[N] = f,d¢aN@-*)(z)C) (x) and 
scaling analysis tells that the regulated classical scalar constraint Cr,{N] scales as 
ox e4e—24(K—2)/2 ¢—2¢—(d-1) (4-1) — e4(1-k) | Tn the special case of k = 1, there is no 
explicit dependence on € and consequently the right-hand side of Eq. (225) is trivial- 
ized (as long as € becomes sufficiently small). In a sense, the quantum operator Cc [N] 


implemented in the standard approach fails to grasp the geometrical significance 
dictated by Eq. (224c) and hence the quantum theory might not faithfully corre- 
spond to the starting classical theory (however, see Ref. 172 for a counterargument). 

If one choose to construct the scalar constraint operator by starting with C\” 
of higher density weight, one would has enough scaling factors of e~! to match 
the scaling of Cpig[S] on the right-hand side of Eq. (224c) (as Cpig[S] involves 
spatial derivatives via NO,M — M0,N, which gives rises to at least one factor of 
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e—!) and be able to obtain an off-shell quantum representation for the constraint 
algebra. This “off-shell closure” approach indeed can be rigorously formulated for 
a generally covariant U(1)° gauge theory, which can be understood as the G > 0 
limit of Euclidean gravity.17? 1” For Euclidean LQG, the new study of Ref. 176 
presented some evidence that, working with k = 4/3, there exists such a scalar 
constraint operator well defined on a vector space VY, which turns out to be a 
suitable generalization of the Lewandowski—Marolf habitat.17+ 

In the off-shell closure approach, the geometrical interpretation of the action 
of the scalar constraint operator becomes more transparent, which should help 
to elucidate the quantum dynamics and the low-energy physics. Furthermore, as 
the requirement of being anomaly-free at the off-shell level is supposed to impose 
more restrictions on permissible regulators, the infinite multitude of quantization 
ambiguities could be enormously reduced or even uniquely fixed. 


12.5. Loop quantum gravity versus spin foam theory 


The canonical (Hamiltonian) formalism of LQG and the covariant (sum-over- 
histories) formalism of the spin foam theory are closely related to each other, yet up 
to now spin foam models have not been systematically derived from the standard 
canonical theory of LQG. Over the past years, the spin foam theory in relation to 
the kinematics of LQG have been clearly established.9°1”7 !®! For the dynamics, 
on the other hand, the two formalisms bear close resemblance to each other but 
it remains a challenging open problem to clearly derive the relation in the four- 
dimensional theory. (In the three-dimensional theory, a clear-cut relation between 
the canonical quantization of three-dimensional gravity and spin foam models has 
been nicely established.!8?) 


12.6. Covariant loop quantum gravity 


Even though the precise connection between LQG and the spin foam theory is 
still not completely clear, the merger of the canonical and covariant approaches 
has suggested a rather well-established theory formulated in a covariant formal- 
ism known as “covariant loop quantum gravity,” which provides us a brand new 
perspective on LQG183 185 ( 
various quantization procedures (canonical, spin foam, etc.) all lead to the over- 


also see Ref. 13 for a comprehensive account). Since 


lapped results, perhaps it is the time to “leave the ladder behind” (as advocated 
in Ref. 183) and take the merged theory of covariant LQG seriously as the defining 
quantum theory of gravity instead of “deriving” it by quantizing the classical GR. 
The kinematics of covariant LQG is well defined and background independent, and 
the dynamics of it is given in terms of a simple vertex function, largely determined 
by locality, diffeomorphism invariance, and local Lorentz invariance. 

In the framework of covariant LQG, transition amplitudes for given boundary 
states can be computed explicitly and then compared with the classical theory by 
the techniques of holomorphic coherent states (see Sec. 8.3 and references therein). 
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Particularly, the two-point function of the Euclidean theory over a flat spacetime 
has been computed to the first order in the vertex expansion, and has been shown 
to converge to the free graviton propagator of QG in the large distance limit.°! The 
n-point functions of the Lorentzian theory have yet to be computed and compared 
with the vertex amplitude of conventional perturbative QG on Minkowski space. 


12.7. Spin foam cosmology 


In the framework of covariant LQG, by choosing the holomorphic coherent states 
of geometry peaked on homogeneous isotropic metrics, it is possible to compute 
the transition amplitude in the vertex and graph expansions for the evolution of a 
homogenous isotropic universe. The calculation has been performed and the result- 
ing amplitude in the classical limit appears to be consistent with the FLRW evolu- 
tion.!8° Furthermore, if a cosmological constant term is added, the result matches 
the de Sitter solution in the classical limit.!5” 

This approach starting from the full theory of covariant LQG and then taking 
an approximation of the vertex and graph expansions is called “spin foam cosmol- 
ogy.” By comparison, in LQC one studies the exact solutions in a symmetry-reduced 
quantum theory, while in spin foam cosmology one studies the approximate solu- 
tions in the full quantum theory. Casting LQC in an spin foam-like expansion has 
also been considered.18° "194 
of LQC can be regarded as two converging approaches to the sum-over-histories 
formalism of quantum cosmology. 


Spin foam cosmology and the spin foam-like expansion 


12.8. Quantum reduced loop gravity 


Quantum reduced loop gravity (QRLG) is a framework introduced for the quan- 
tum theory of a symmetry-reduced sector of GR, based on a projection from the 
kinematical Hilbert space of the full theory of LQG down to a subspace representing 
the proper arena for the symmetry-reduced sector. It was first proposed in Refs. 192 
and 193 for an inhomogeneous extension of the Bianchi I cosmological model. This 
approach provides a direct link between the full theory and its cosmological sector 
and also sheds light on the relation between LQC and LQG. 

It was later shown in Ref. 194 that the QRLG technique can be applied to 
broader cases beyond the cosmological context whenever the spatial metric can be 
gauge-fixed to a diagonal form. The technique projects the Hilbert space to the 
states based on the reduced graphs with Livine-Speziale coherent intertwiners.%° 
The framework of QRLG could simplify the analysis of the dynamics in the full 
theory and other issues, such as the coupling between quantum geometry and matter 
and the relation between canonical and covariant approaches. 


12.9. Cosmological perturbations in the Planck era 


Cosmological inflation is a popular paradigm in modern cosmology. The theory of 
inflation has successfully explained how quantum fluctuations sow the primordial 
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seeds that grow into the large-scale structure at late times of our universe and made 
a number of predictions that have been confirmed by a range of observations. In the 
standard inflationary scenario, cosmological perturbations are described by QFT 
on classical cosmological spacetimes, and thus the domain of validity excludes the 
Planck era. 

In a series of papers Refs. 195-197 (also see Ref. 198 for a nice summary), by 
studying the dynamics of quantum fields representing scalar and tensor perturba- 
tions on quantum cosmological spacetimes using techniques from LQG, the infla- 
tionary paradigm is extended to a self-consistent theory covering the epoch from the 
big bounce in the Planck era to the onset of slow-roll inflation. This pre-inflationary 
dynamics could yield deviations from the standard inflationary scenario and give 
rise to novel effects, such as a source for non-Gaussianity. These novel effects might 
catch a glimpse into the deep Planck regime of the early universe in future cosmo- 
logical observations. 


12.10. Spherically symmetric loop gravity 


LQC is the loop quantum theory of minisuperspaces in the sense that degrees of 
freedom are truncated to a finite number by the symmetry of homogeneity (and pos- 
sibly also isotropy). The next simplest symmetry-reduced framework is the spher- 
ically symmetric midisuperspace, in which the spherical symmetry is imposed but 
inhomogeneity along the radial direction is retained, thus still giving rise to a field- 
theoretical system of infinite degrees of freedom. Ashtekar’s formalism for spheri- 
cally symmetric midisuperspaces and its loop quantization have been studied and 
developed with various degrees of rigor.!99 2°? The theory of spherically symmetric 
loop gravity provides an arena for testing important issues that are too difficult 
in the full theory of LQG and trivialized in minisuperspace (LQC) models. Par- 
ticularly, the SU(2) internal gauge is reduced to U(1) and the three-dimensional 
diffeomorphism is reduced to one-dimensional diffeomorphism (in the radial direc- 
tion), making the constraint algebra much simpler yet nontrivial. The framework of 
spherically symmetric loop gravity can be used to study loop quantum geometry of 
spherically symmetric black holes (Schwarzschild black hole, spherical gravitational 
collapse, etc.). It is indicated that in the resulting quantum spacetime the black hole 
singularity is avoided,?°? ?°° suggesting that the information loss problem could be 
resolved accordingly. 


12.11. Planck stars and black hole fireworks 


One of the key insights obtained in LQC is that the quantum geometry of LQG 
gives rise to opposing quantum gravitational force, which becomes appreciable when 
the matter density comes close to the Planck scale density ~1/(G?h). Based on the 
insight, recently, a new possible scenario of collapsing black holes has been sug- 
gested’: The gravitational collapse of a star does not lead to a singularity but to 
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the quantum gravitational phase called a “Planck star,” where the very huge gravi- 
tational attraction is counterbalanced by the opposing quantum force. Accordingly, 
a black hole hides a core of a Planck star, which is of the Planck scale density but 
can be much larger than the Planck scale size (depending on the initial mass of the 
collapsing star). As the black hole evaporates, the core remembers the initial mass 
and the final explosion can occur at macroscopic scale, thus providing a possible 
mechanism for recovery of information loss. More interestingly, the objects of Planck 
stars could produce detectable signals of quantum gravitational origin.207:2°% 

Furthermore, the macroscopic remnant can develop into a white hole. As shown 
in arecent paper Ref. 209, there is indeed a classical metric that satisfies the Einstein 
field equation outside a finite spacetime region where matter collapses into a black 
hole and then emerges from a white hole. Therefore, a black hole can quantum- 
tunnel into a white hole in the similar fashion that the wave packet representing a 
collapsing universe tunnels into a wave packet representing an expanding universe in 
LQC. A distant observer thus sees a dimming star, after a very long time, reemerge 
and burst out matter — a phenomenon called “black hole fireworks.” Under the 
scenario of black hole fireworks, the arguments over the information paradox have 
to be drastically revised. 


12.12. Information loss problem 


It is widely expected that QG, once fully developed, should resolve several impor- 
tant problems in QG, particularly the singularity problem and the black hole infor- 
mation paradox. As both the cosmological and black hole singularities are indicated 
to be resolved by the LQG effects, it is strongly suggested that the information lost 
in the process of black hole evaporation should be recovered in a singularity-free 
scenario. In Ref. 210, it was analyzed in detail for two-dimensional black holes that 
quantum geometry effects indeed recover the information loss, primarily because 
the black hole singularity is resolved and consequently the quantum spacetime is 
sufficiently larger than the classical counterpart. For four-dimensional collapsing 
black holes, the aforementioned scenarios of Planck stars and black hole fireworks 
in particular suggest a similar mechanism for information recovery. 

The model of Ref. 210 provides a concrete example of avoiding the information 
loss problem under the “remnant scenario,” in which black hole evaporation stops 
at some point, leaving a black hole remnant that is correlated with the Hawking 
radiation and allowing the combined state to remain pure. The remnant scenario 
generally implies the existence of a very long-lived remnant, which, unfortunately, 
may cause other problems (e.g. the problem of infinite remnant production?"). 
212 investigated an example from LQG in which the 
singularity problem is solved but the information loss problem is made worse. It 


Meanwhile, a recent paper 


was argued that the aggravated information loss problem is likely to be a generic 
feature of LQG, therefore putting considerable conceptual pressure on the theory 
of LQG. 
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The information loss problem has been around for a long time. There is still no 
consensus over how the problem is to be resolved and it remains one of the focal 
points in the research of QG. 


12.13. Quantum gravity phenomenology 


A legitimate quantum theory of gravity must make unambiguous predictions that 
in principle can be empirically tested by experiments or observations. Contrary to 
popular opinion, LQG does make definite predictions. For example, any measured 
physical area (such as the total cross-section of a scattering process) must be given 
by the discrete spectrum of area. (Also see Ref. 213 for the possible effects of angle 
quantization on scattering.) The true problem is that these predictions demand 
experiments or observations probing the Planck scale, which is believed to be far 
out of current reach. The impasse could be overcome soon, as the Planckian physics 
may not be completely unreachable after all by present and near-future technology. 

As mentioned earlier, loop quantum effects in the pre-inflationary era could give 
rise to sufficient deviations from the standard inflationary scenario. Some research 
works have attempted to reveal possible observable footprints of these effects on 
the cosmic microwave background along the lines of Refs. 214-218. 

Also as noted previously, the existence of Planck stars could produce detectable 
signals. The possible phenomenological consequences of Planck stars in gamma-ray 
bursts were studied in Ref. 208. 

Additionally, while the Lorentz invariance can be made manifest both in LQG 
and in the spin foam theory (see Ref. 219 and references therein), many other 
models suggest violation of Lorentz invariance in the deep Planck regime, which 
will modify the Lorentzian energy-momentum dispersion relation at lower-energy 
scale. Observations of ultra-high-energy cosmic rays — the most energetic particles 
ever observed — have thus far put strong constraints on deviations from the 
Lorentzian dispersion relation. Future investigation with improved precision will 
either impose even stronger constraints or unveil breakdown of the Lorentz invari- 
ance, thus enabling theorists to discriminate between different models of LQG and 
the spin foam theory. See Refs. 220 and 221 for more discussions. 


12.14. Supersymmetry and other dimensions 


It is a celebrated virtue that LQG does not require any hypothetical ingredients, yet 
it has always been desirable to incorporate supersymmetry and extra dimensions at 
one’s disposal for the sake of both theoretical interest and comparison with other 
QG approaches (string theory in particular). Early attempts can be found e.g. 
in Refs. 25-27. Recently, a series of papers??? 72° have been devoted to rigorous 
formulation of the loop quantum theory for supergravity and for all dimensions. 
Some progress has been made toward calculating black hole entropy from loop 


230,231 


quantum theory in higher dimensions. It would be very instructive to compare 


these results to those obtained by string theory inspired approaches. 
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